NR Native PVIIA Search: 



BLASTP 2.2.4 [Aug-26-2002] 



Reference : 

Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:338 9-34 02. 



RID: 1038287392-07 658-24 371 



Query= 

(27 letters) 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,242,768 sequences; 395,571,179 total letters 



If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 



Taxonomy reports 



Distribution of 156 Blast Hits on the Query Sequence 



Sequences producing significant alignments: 


Score 
(bits) 


E 

Value 




7519866|piri IA58997 kappa-conotoxin PVIIA - cone shell (... 


98 


9e-21 




22002036|splP56633|CXK7 CONPU Kappa-conotoxin PVIIA prec... 


98 


9e-21 


gi 
gi 
gi 


3891872 Ipdbl IKCPi 3d Structure Of K-Conotoxin Pviia, A ... 
3891964 ipdbi 1AV3 i Potassium Channel Blocker Kappa Conot . . . 
1905981 1 qb iAAB50229. 1 1 Gal/GalNAc lectin heavy subunit [... 


93 
93 
33 


4e-19 
4e-19 
0.32 


gi 


17158399 1 ref 1 NP 477819.11 wsv297 [shrimp white spot synd. . . 


30 


2.5 


gi 


15895618 1 ref 1 NP 348967.11 Glycosyltransf erase [Clostridi... 


29 


8.1 


gi 


10120503 Ipdb! 1C6WI A Chain A, Maurocalcin From Scorpio Ma... 


28 


11 


gi 


19577387 1 emb 1 CAD27769. 1 1 hypothetical protein [Anopheles... 


28 


11 


gi 


21402496 i ref 1 NP 658481.11 minC, Bacterial proteins invol . . . 


28 


15 


gi 


23021387 1 qb| ZP 00061064.11 hypothetical protein [Clostri... 


28 


15 


gi 


17567271 1 ref 1 NP 510129.11 Predicted CDS, p-glycoprotein ... 


28 


15 


gi 


20454175 1 qb IAAM22197 . 1 IAF501276 1 Gal/GalNAc lectin heav. . . 


27 


19 


gi 


7494127 Ipir M T09229 galactose binding adhesin heavy chai . . . 


27 


19 
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2247 6954 j gb 1 AAMy /ooo . 1 1 bal/baiNAc lecnn neavy suDuniL ... 


97 
z / 


1 Q 
X y 


gi 


21oo4U4|pirl ib/UDDo lecrm neavy cnam, iN-aceT:yj.gaj.acLOs . . . 




1 Q 
X y 




22yc)4zoo|gD|^P uuuiiooy.ij nypot:ner.ica± pro re in l Knouops . . . 


97 


X y 


gi 


l/ODJ/oo|ireriNir du/i^'D.j.i irireaiCLeQ L/Uo^ / lx dnoiueiiijjx dut: . . . 


97 


9 

z. u 


gi 


21 / (Joyoo 1 rex 1 Nir boJ^iDU. 1 j ceramiae Kinase (_l^ius iuus cuius j . . . 


97 
z, / 


9 

Z D 


gi 


b4o / y^j 1 Spl rObyoJ lAr fUYl rvH. i /ixaiUin pieCUXSUX ^Mipxid dXJJUuiX . . . 


97 
z. / 


9 f, 
z u 


gi 


2J51U111 1 rex 1 NF /Uz / / / . i j eryxnrocyLe meiriDrane proxem i . . . 


97 


^ R 


gi 


l/oUo4/y|rex!Nir 4yj)ioo.X! iNiuciedr noxiuone Kecep lux x diux x . . . 


27 


^ R 


gi 


1 /bDyD4y 1 rex i Nr Diuizz . 1 [ rreaicuea uuci, zmc ringer . . . 


9 

Z D 


d 7 


gi 


2 3 61 2 o lo 1 rex 1 NP /U4oo/.lt nypOLnecicai prorem [ riasmoai . . . 


9 

Z D 


A 1 


gi 


2361JU/(j|rexjNr /UJjyz.lj nypornecicai prorem [rxasmoai . . . 


9 

Z D 


D O 


gi 


23123030 1 gb 1 ZP OOlUoloo.ll nypotnetical protein [Procnio... 


9 

z b 


D J 


gi 


21302332 1 qb 1 EAA14477 . 1 1 agCP8528 [Anopheles gambiae str.... 


26 


63 


qi 


21288202 1 qb 1 EAA00523 . 1 i agCP9342 [Anopheles gambiae str.... 


26 


63 


gi 


l/oJoobb|rer|NP oui/ou.x| <6inc x mger xype xixydiiu uxii. . . 


9 S 


ft R 


gi 


1 / Ubo4lZ / iqD) rirVfl X / O y 4 . X 1 M/in X/O-?^ UXirvIiUWll ^pXOLcXii XwX L JOV_, , , . 


25 


85 


gi 


1 /oo /-^by 1 rex | NP oiuizo . i | ir^— gxycoprorem xamixy uieiuxjex \ . . . 


9 S 
z. ^ 


R R 


qi 


z!U404il / / 1 go I /i/iLYlZ ZXyo.X|H.£!OUXZ / / X odX / kJdltNM.L- xt;L.uxxi iicdv. . . 


25 


85 


gi 
gi 
gi 


zlJJ4Uolpii| |o/UDb4i lecLin neavy cnam , in acetyiydxdCLUb. . . 
1 135 y / lb 1 pir 1 1 i4b /Z.Z conservea nyporneLicai prorem [imp. . . 
23510154 1 rex | NP /Uzoz-U . 1 | KNA-Dinamg prorem ox pumiiio. . . 


9 S 
z, -J 

9 ^ 

9 R 
z o 


ft R 

ft R 

ft R 
o o 


gi 


15450391 ! gb 1 AAKyD4oy . 1 1 Atzgl booU/ b ipid . y [Araoiaopsis r... 


9 R 
z o 


ft R 


gi 


204541 /9 1 gb | AAMzziyy . 1 I Ah ouiz / o i bar /ua IN AC lecrm neav. . . 


9 R 
z o 


ft R 


gi 


6322 4 2 8 1 ref 1 NP OlzbOz.ll Protein requirea tor cell viaoi... 


9 R 
Z 3 


P R 


gi 


1 4 z 5 u U / I 1 gb 1 AAHU o 4 4 U . 1 ] AAnU o 4 4i u o iiuiiar r o L-bi z y prorex . , . 


9 R 


ft R 


gi 


o9534zl i emb | CABybo /z . 1 j AiupniOli prorem i_ tsrancniosroma . . . 


9 R 
z o 


ft R 

O 3 


gi 


3yyo4U 1 spl r ~j ZU Z Z \ LjIIjI iiiNlnX bAljAb IUoXj UNnXoX i/idIjEj XjUjO i x in . . . 


9 R 


ft R 


gi 


1429059 / 1 gb 1 AAHU 90 / / . 1 | AAnuyu / / uLri-zy prorem Lriorno sap . . . 


9 R 
z o 


ft R 


gi 


7705724 1 ref 1 NP 057041.11 CGI-29 protein [Homo sapiens] >. . . 


"25 


85 


gi 


1063o4ol 1 einb | CAulz b4 z, . 1 | aJ i / ytii u . z ^ bimiiar ro uux zy p... 


9 R 

Z, J 


ft R 


gi 


/4ybb4z|pxr| | iiyo4U nypornericai prorem k^zouh . ± ~ L-aeno. . . 


9 R 


ft R 

O J 


gi 


17569651 1 rex 1 NP 510121. 1| Nuclear receptor-liKe , contain... 


9 R 
Z O 


X 1 4 


gi 


7513590 1 pir 1 1 T4223U AF4 protein - mouse (tragmenrj >gi | z . . , 


9 R 
Z O 


X X 4 


gi 


7661970 1 ref i NP 055455.11 KlAAUl /3 gene product LHomo sap... 


9 R 
Z D 


11/1 
1 X 4i 


gi 
gi 


159019 1 gb 1 AAA29126 . 1 i alpha-phosophoinosit ide-specif ic p... 
17 5 6397 0 1 ref 1 NP 50d9o/.1| Zmc tmger L4 type tamiiy mem... 


9 R 
Z D 

9 R 

z o 


1 1 /t 
X X 4 

1 1 Zl 
X X 4 


gi 


1254 93 i sp 1 P07313 1 KML2 RABIT Myosin light chain kinase 2,... 


9 c, 
Z 3 


114 


qi 


13359167 i abi i BAB3331b . 1 1 KlAAlb4b protein [homo sapiensj 


9 R 
Z O 


1 1 

1 1 fi 


gi 


17566198 IreflNP oU/b4y.ll Preaict ea bUb , zmc xmger b4 ... 


9 R 
Z O 


114 

X 1 4 


gi 


65 /z33U 1 emb j LABbzy / / . 1 | oAoyhiio . z t novel prorem similar . . . 


9 R 
Z J 


114 

X X 4 


gi 


20oyol46irex|XP lz/blo.i| similar ro nypornericai prorei , . . 


9 R 
Z O 


114 
X X f± 


qi 


/ 6 62 55 0 1 rei |NP Ua4oUi.i| rKUUoy / prorem [riomo sapiensj . . . 


9 R 

Z 3 


114 

X X 4 


qi 


zD023y4D|retlAP zu4iy/.il nomoiog o x numan iyixiXj i z uniuenr . . . 


9 R 


114 

X X 4 


gi 


Iz54y4 1 spj Pzubcsy 1 KMLz KAi Myos m iignr cnam Kinase z , s . . . 


9 R 


114 

X X 4 


gi 


20o4yyob|rex|AP 13oU4u.i| similar ro rioosomai prorem o . . . 


9 R 
Z 3 


114 

X X 4 


gi 


21553101 i rex 1 NP bbOlzo.ll atamin precursor, aipna aioumi , . . 


9 R 
Z O 


114 
X X f± 


gi 


14 993776 1 ref ! NP 149109.11 skeletal myosin lignt cnam Ki . . . 


9 R 
Z O 


11/1 
114 


gi 


17 105364 1 ref i NP 476557.1! myosin light chain kinase 2, s... 


Z D 


11/1 
114 


gi 


20072386 [ gb 1 AAH26681 . 1 1 Unknown (protein tor IMAbb : 3Ub4 4 . . . 


Z. D 


11/1 
114 


gi 


20336726 ! ref 1 NP 073603.2] ceramide kinase; lipid kinase ... 


z,o 


11/1 
114 


gi 


13878317 1 sp ! 088573 j AF4 MOUSE AF-4 protein (Proto-oncogen, . . 


z,o 


11/1 
114 


gi 


3930 63 1 gb i AAA182 96 . 1 1 hemagglutinin-neuraminidase >gi|39,.. 


O /I 

z 4 


loo 


qi 


1 62 1 4 17 1 gb ! AAB17 18 1 , 1 1 hemagglutinin-neuraminidase [Huma... 


Z 4 


1 R 

loo 


gi 


3930 67 1 gb 1 AAA03318 , 1 I hemagglutinin-neuraminidase 


9 /I 
Z 4 


1 R7 
1 O o 


gi 


X b ^ 1 o y / i go 1 AAiD 1 / X / X . X 1 nemaggx ur iriiri ric: ux diiiiiiiods e [ n uiiia . . . 


9 d 

Z. 


1 R^ 

X -J o 


gi 


393053 I qb I AAA18291 . 1 | hemagglutinin-neuraminidase 


~24 


153 


gi 


17544200 1 ref 1 NP 502862.11 Putative secreted or extracell... 


24 


153 


gi 


25155182 j ref i NP 741645. If Nuclear Hormone Receptor famil... 


24 


153 


gi 


1621421 1 qb | AAB17183 . 1 | hemagglutinin-neuraminidase [Huma. . . 


24 


153 


gi 


393047 1 gb | AAA18288 . 1 I hemagglutinin-neuraminidase 


24 


153 


gi 


t 13878219 1 ref 1 NP 113566.11 ribonuclease/angiogenin inhibi . . . 


24 


153 
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393059|gb|AAA18294 . 1 



1621409|gb|AAB17177.1 



1621413|qb|AAB17179.1| 



1621405igblAAB17175.1 | 



1621419[gb|AAB17182. 1 | 



729836 I sp I P41323 | IL6 



9629903|ref |NP 045936.11 ORF-C 



393051 |gb|AAA18290. 1 | 



13027596|ref |NP 



15231494 Iref [NP 189706.1 



192 63 653 j gb | AAH2 5255. 1 | 



22507470|gb|AAH19431 



12383068 Iref INP 073738 



9247084 I gbl AAF86275. 1 



19921116jref INP 



393049 I gb I AAA18289. 1 



16214151gb|AAB17180.1 | 



1621399|gb|AAB17172.1i 



hemagglutinin-neuraminidase 
hemagglutinin-neuraminidase [Huma . 
hemagglutinin-neuraminidase [Huma . 
hemagglutinin-neuraminidase [Huma . 
hemagglutinin-neuraminidase [Huma . 
CAN FA Interleukin-6 precursor (IL-6, 
[Walleye dermal sarcoma v. 
hemagglutinin-neuraminidase 
076421 . 1 I hypothetical protein MGC2487 [. 

hypothetical protein; protein . 
Similar to hypothetical protein . 
1 I Similar to hypothetical protein . 
1 1 hypothetical protein FLJ21347 . 
interleukin-6 [Canis familiaris] 
609448.11 CG12299-PA [Drosophila melanog. 

neuraminidase 
neuraminidase [Huma. 
neuraminidase [Huma. 



393069|gb|AAA03319.H 



hemagglutinin 
hemagglutinin 
hemagglutinin 
hemagglutinin-neuraminidase 
hemagglutinin-neuraminidase [Huma 
16588867 |gb|AAL26923. 1 IAF328929 1 interleukin-6 [Canis f 
10198059 I gb I AAG15164 . 1 | nuclear receptor NHR-63 [Caenorh 
23507986iref INP 700656.11 hypothetical protein [Plasmodi 

neuraminidase [Huma 
[Huma 



1621411|gb|AA.B17178.1 | 



1621403 |gb|AAB17174 . 1 



1621407|qb|AAB17176.1| 



hemagglutinin- 
hemagglutinin-neuraminidase 



24 


153 


24 


153 


24 


153 


24 


153 


24 


153 


24 


153 


24 


153 


24 


153 


24 


153 


24 


153 
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24 


153 


24 


153 


24 


153 


24 


153 


24 


153 



Alignments 

Get selected sequences 
Select all 
Deselect all 



> gi I 7519866 I pir I IA58997 kappa-conotoxin PVIIA - cone shell (Conus purpurascens ) 
Length = 27 

Score = 98.2 bits (224), Expect = 9e-21 

Identities = 27/27 (100%), Positives = 27/27 (100%) 

Query: 1 CRIPNQKCFQHLDDCCSRKCNRFNKCV 27 

CRIPNQKCFQHLDDCCSRKCNRFNKCV 
Sbjct: 1 CRIPNQKCFQHLDDCCSRKCNRFNKCV 27 



> gi I 22002036 I sp { P5 6 633 | CXK7 CONPU Kappa-conotoxin PVIIA precursor (Fin-popping peptide) 
Length = 72 

Score - 98.2 bits (224), Expect = 9e-21 

Identities = 27/27 (100%), Positives = 27/27 (100%) 

Query: 1 CRIPNQKCFQHLDDCCSRKCNRFNKCV 27 

CRIPNQKCFQHLDDCCSRKCNRFNKCV 
Sbjct: 4 6 CRIPNQKCFQHLDDCCSRKCNRFNKCV 72 



> gi I 38 918 72 | pdb j IKCPj 3d Structure Of K-Conotoxin Pviia, A Novel Potassium 
Channel-Blocking Toxin From Cone Snails, Nmr, 22 
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structures 
Length = 28 

Score = 92.7 bits (211), Expect = 4e-19 
Identities = 26/27 (96%), Positives = 26/27 (96%) 

Query: 1 CRIPNQKCFQHLDDCCSRKCNRFNKCV 27 

CRI NQKCFQHLDDCCSRKCNRFNKCV 
Sbjct: 1 CRIXNQKCFQHLDDCCSRKCNRFNKCV 27 



> gi I 38 91964 | pdb I 1AV3 | Potassium Channel Blocker Kappa Conotoxin Pviia From C. 
Purpurascens , Nmr, 20 Structures 
Length = 27 

Score - 92.7 bits (211), Expect = 4e-19 
Identities = 26/27 (96%), Positives =26/27 (96%) 

Query: 1 CRIPNQKCFQHLDDCCSRKCNRFNKCV 27 

CRI NQKCFQHLDDCCSRKCNRFNKCV 
Sbjct: 1 CRIXNQKCFQHLDDCCSRKCNRFNKCV 27 



> gi I 1905 981 I gb | AAB5022 9. 1 | Gal/GalNAc lectin heavy subunit [Entamoeba dispar] 
Length = 1125 

Score = 33.3 bits (71), Expect - 0.32 

Identities = 12/17 (70%), Positives = 12/17 (70%), Gaps = 3/17 (17%) 

Query: 13 DDCCSRK--CNRFN-KC 26 

DDCCSRK C FN KC 
Sbjct: 700 DDCCSRKSQCGNFNGKC 716 



> qi [ 17158399 [ ref I NP 477819.11 wsv297 [shrimp white spot syndrome virus] 

gi|17016693| gbj AAL332 99. 1 [ wsv297 [shrimp white spot syndrome virus] 

gi I 19481945 I gb | AAL89221 . 1 | WSSV353 [shrimp white spot syndrome virus] 
Length - 69 

Score =30.3 bits (64), Expect =2.5 

Identities = 9/14 (64%), Positives = 10/14 (71%), Gaps = 3/14 (21%) 

Query: 12 LDDCCSRKCNRFNK 25 

LD+CC CNRF K 
Sbjct: 2 9 LDECC CNRFQK 39 



> gi I 15895618 1 ref |NP 348967.11 Glycosyltransf erase [Clostridium acetobutylicum] 
gi [ 15025361 i gb j AAK80307 . 1 | AEQ077 35 10 Glycosyltransf erase [Clostridium acetobutylicum] 
Length = 252 

Score =28.6 bits (60), Expect =8.1 

Identities - 11/24 (45%), Positives = 13/24 (54%), Gaps = 11/24 (45%) 

Query: 8 CFQHLDDCCSRK CNRFN 24 

CF + + DCC K CNRFN 
Sbjct: 193 CF--IEDCC--KQGIKIYSCNRFN 212 
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> gi I1Q12Q5Q3I pdb | 1C6W | A Chain A, Maurocalcin From Scorpio Maurus 
Length = 33 

Score - 28.2 bits (59), Expect = 11 

Identities = 10/21 (47%), Positives = 11/21 (52%), Gaps - 6/21 (28%) 

Query: 8 CFQHLD DCCSRKCNR 22 

C HL DCCS-fKC R 

Sbjct: 3 CLPHLKLCKENKDCCSKKCKR 2 3 



> gi I 19577387 I emb | CAD2 77 69 . 1 | hypothetical protein [Anopheles gambiae] 
Length = 257 

Score =28.2 bits (59), Expect - 11 
Identities == 7/8 (87%), Positives = 8/8 (100%) 

Query: 15 CCSRKCNR 22 

CCSRKC+R 
Sbjct: 41 CCSRKCSR 4 8 



> gi |214Q2496|ref|NP 658481. Ij minC, Bacterial proteins involved in chromosomal 
partioning 

[Bacillus anthracis A2012] 
Length = 228 

Score =27.8 bits (58), Expect = 15 
Identities = 1/1 (100%), Positives = 1/1 (100%) 

Query: 11 HLDDCCS 17 

HLDDCCS 
Sbjct: 21 HLDDCCS 27 



> qi i 23021387 I gb I ZP 00061064.11 hypothetical protein [Clostridium thermocellum ATCC 
27405] 

Length = 143 

Score =27.8 bits (58), Expect = 15 

Identities = 9/18 (50%), Positives - 11/18 (61%), Gaps = 4/18 (22%) 

Query: 14 DC CSRKCNRFNKCV 27 

DC S KC+RF+ CV 

Sbjct: 91 DCVVEDSSPKCDRFDNCV 108 



> gi I 17567271 I ref I NP 510129.11 Predicted CDS, p-glycoprotein family member 
[Caenorhabditis 

elegans ] 

gi j 7499666 Ipir I IT21269 hypothetical protein F22E10.4 - Caenorhabditis elegans 
gi j 387 62 94 I emb | CAA91802 . 1 | C. elegans pgp-15 protein (corresponding sequence F22E10.4) 
[Caenorhabditis elegans] 
Length = 1270 



Score - 27.8 bits (58), Expect = 15 

Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 3/11 (27%) 

Query: 8 CFQHLDDCCSR 18 

CFQH CCSR 
Sbjct: 165 CFQH CCSR 172 



> gi I 204 5417 5 I gb jAAM22197 . 1 | AF50127 6 1 Gal/GalNAc lectin heavy subunit region D 
[Entamoeba histolytica] 
Length = 372 

Score =27.4 bits (57), Expect = 19 

Identities = 12/18 (66%), Positives = 12/18 (66%), Gaps = 3/18 (16%) 

Query: 13 DDCCSRK--CNRFN-KCV 27 

DDC SRK C FN KCV 
Sbjct: 92 DDCNSRKSQCGNFNGKCV 109 



> gi I 74 94127 I pir I IT09229 galactose binding adhesin heavy chain - Entamoeba histolytica 
gi I 2 90 64 9 I gb | AAA18 8 28 . 1 | galactose-specif ic adhesin 170kD subunit 
Length = 1292 

Score =27.4 bits (57), Expect = 19 

Identities - 12/18 (66%), Positives = 12/18 (66%), Gaps = 3/18 (16%) 

Query: 13 DDCCSRK--CNRFN-KCV 27 

DDC SRK C FN KCV 
Sbjct: 730 DDCNSRKSQCGNFNGKCV 747 



> gi 122476954 1 gb | AAM97 358 . 1 | Gal/GalNAc lectin heavy subunit region D [Entamoeba 
histolytica] 

Length = 372 

Score =27.4 bits (57), Expect = 19 

Identities = 12/18 (66%), Positives - 12/18 (66%), Gaps = 3/18 (16%) 

Query: 13 DDCCSRK — CNRFN-KCV 27 

DDC SRK C FN KCV 
Sbjct: 92 DDCNSRKSQCGNFNGKCV 109 



> gi I 2133404 I pir [ IS70663 lectin heavy chain, N-acetylgalactosamine-specif ic - Entamoeba 
histolytica (fragment) 
gi I 9930 53 1 gb | AAA7 5551 . 1 | GalNAc lectin heavy subunit 
Length = 194 

Score - 27.4 bits (57), Expect = 19 

Identities = 12/18 (66%), Positives - 12/18 (66%), Gaps = 3/18 (16%) 



Query: 13 DDCCSRK — CNRFN-KCV 27 

DDC SRK C FN KCV 
Sbjct: 131 DDCNSRKSQCGNFNGKCV 148 
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> gii22964283|qb[ZP 00011889.11 hypothetical protein [Rhodopseudomonas palustris] 
Length - 559 

Score =27.4 bits (57), Expect = 19 

Identities = 9/12 (75%), Positives = 10/12 (83%), Gaps = 1/12 (8%) 

Query: 8 CFQHLDDCCSRK 19 

C+QHLDD SRK 
Sbjct: 4 9 CYQHLDD-ISRK 59 



> gi I 17563788 I ref I NP 507195.11 Predicted CDS, 7 transmembrane chemoreceptor family 
member family 

member [Caenorhabditis elegans] 
gi I 7506996 ipir | | T24389 hypothetical protein T03E6.2 - Caenorhabditis elegans 
qi I 387 9369 | emb | CABQ727 7 . 1 | Hypothetical protein T03E6.2 [Caenorhabditis elegans] 
Length - 213 

Score = 26.9 bits (56), Expect = 26 

Identities = 8/13 (61%), Positives = 9/13 (69%), Gaps - 3/13 (23%) 

Query: 1 CR IPNQKCFQ 10 

CR + NQKCFQ 
Sbjct: 55 CRTMLVSNQKCFQ 67 



Score = 20.2 bits (40), Expect = 2886 
Identities = 5/6 (83%), Positives = 6/6 (100%) 

Query: 5 NQKCFQ 10 

N+KCFQ 
Sbjct: 125 NRKCFQ 130 



> gi|21703938|ref INP 663450.11 ceramide kinase [Mus musculus] 
gi I 2162434 2 I dbj IBAC01155. 1 | ceramide kinases [Mus musculus] 
Length - 531 

Score = 26.9 bits (56), Expect = 26 

Identities = 11/21 (52%), Positives - 12/21 (57%), Gaps = 5/21 (23%) 

Query: 9 FQHLDDCCS RKCNRFN 2 4 

F HL D S RKC+RFN 
Sbjct: 410 FAHLGDGSSDLILIRKCSRFN 430 



> gii543793|sp|P36953[AFAM RAT Afamin precursor (Alpha-albumin) (Alpha-Alb) 
gi [ 1363266 I pir I IA53195 afamin precursor - rat 

gi I 45635 9 I emb | CAA53 994 . 1 | unnamed protein product [Rattus norvegicus] 
Length = 608 

Score = 26.9 bits (56), Expect = 26 

Identities = 10/22 (45%), Positives = 12/22 (54%), Gaps - 9/22 (40%) 
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Query: 12 LDDCCSRK CNR FN 24 

L+DCC+RK C R FN 
Sbjct: 381 LEDCCNRKNPLSCYRHAEDKFN 4 02 



Score = 18.5 bits (36), Expect = 9354 
Identities - 4/5 (80%), Positives = 5/5 (100%) 

Query: 8 CFQHL 12 
CF+HL 

Sbjct: 509 CFEHL 513 



> gi I 23510111 I ref I NP 702777.11 erythrocyte membrane protein 1 (PfEMPl) [Plasmodium 
falciparum 3D7] 

gi I 6562758 I emb I CAB62897 . 1 j erythrocyte membrane protein 1 (PfEMPl) [Plasmodium 
falciparum 3D7] 

Length = 2209 

Score = 26.5 bits (55), Expect = 35 

Identities = 12/24 (50%), Positives = 15/24 (62%), Gaps = 8/24 (33%) 

Query: 9 FQHLDDC--CSR KC-NRFNKC 2 6 

FQH +DC CS+ KC N +KC 
Sbjct: 1356 FQHTEDCKSCSKFRIKCDN--DKC 1377 



> gi I 17508479 I ref I NP 493155.11 Nuclear Hormone Receptor family NHR-89 (nhr-89) 
[Caenorhabditis 

elegans ] 

gi I 7498355 Ipir I | T20208 hypothetical protein E03H4.13 - Caenorhabditis elegans 

gi I 3875219 1 emb I CABO 5470. 1 I C. elegans nhr-89 protein (corresponding sequence E03H4.13) 

[Caenorhabditis elegans] 
gi 138754981 emb | CABO 4 035 . 1 | C. elegans nhr-89 protein (corresponding sequence E03H4.13) 

[Caenorhabditis elegans] 

Length = 310 

Score - 26.5 bits (55), Expect = 35 

Identities = 11/20 (55%), Positives = 12/20 (60%), Gaps - 7/20 (35%) 

Query: 12 LDD CCSRKCNRFNKCV 27 

LDD C R C R+NKCV 
Sbjct: 52 LDDQKQFC--RSC-RYNKCV 68 



> gi I 17569649 I ref jNP 510122. Ij Predicted CDS, zinc finger C4 type Ligand-binding domain 
of 

nuclear hormone receptor like family member 

[Caenorhabditis elegans] 
gi I 7507363 Ipir 1 IT2465Q hypothetical protein T07C5 . 2 - Caenorhabditis elegans 
gi I 387 9537 | emb | CAA9Q2 99 . 1| Hypothetical protein T07C5.2 [Caenorhabditis elegans] 

Length - 334 

Score = 26.1 bits (54), Expect = 47 

Identities = 12/26 (46%), Positives = 13/26 (50%), Gaps = 12/26 (46%) 



Query: 2 RIPNQKCFQHLDDCCSRKCNRFNKCV 2 7 
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R+P KC R C RFNKCV 

Sbjct: 59 RLP--KC RHC-RFNKCV 72 



> qi I 23612818 I ref I NP 704357.11 hypothetical protein [Plasmodium falciparum 3D7] 
gi 1234990961 emb | CAD5117 6 . 1 I hypothetical protein [Plasmodium falciparum 3D7] 
Length = 1408 

Score = 26.1 bits (54), Expect - 47 

Identities - 11/20 (55%), Positives = 12/20 (60%), Gaps = 8/20 (40%) 

Query: 6 QKCFQHLDDCCSRKCNRFNK 25 

+KCF D SRKCN NK 
Sbjct: 1008 EKCF D SRKCN--NK 1019 



> qi I 23613070 I ref I NP 703392.11 hypothetical protein [Plasmodium falciparum 3D7] 
qi I 23504 532 I emb | CAD51412 . 1 | hypothetical protein [Plasmodium falciparum 3D7] 
Length = 2349 

Score =25.7 bits (53), Expect = 63 

Identities = 9/14 (64%), Positives = 11/14 (78%), Gaps = 1/14 (7%) 

Query: 12 LDDCCSRKCN-RFN 24 

LDD C+ KCN +FN 
Sbjct: 1975 LDDICNNKCNDKFN 1988 



Score =17.6 bits (34), Expect = 16841 
Identities = 5/6 (83%), Positives - 5/6 (83%) 

Query: 19 KCNRFN 24 

K NRFN 
Sbjct: 359 KINRFN 364 



> gi I 23123030 I gb I ZP 00105165.11 hypothetical protein [ Prochlorococcus marinus subsp. 
pastoris str. 

CCMP1378] 
Length - 284 

Score =25.7 bits (53), Expect = 63 
Identities = 7/10 (70%), Positives = 9/10 (90%) 

Query: 10 QHLDDCCSRK 19 

+HL +CCSRK 
Sbjct: 211 EHLINCCSRK 220 



> gi I 21302332 I gb|EAA144 77. 1 | agCP8528 [Anopheles gambiae str. PEST] 
Length - 851 

Score = 25.7 bits (53), Expect = 63 

Identities = 8/15 (53%), Positives = 10/15 (66%), Gaps = 3/15 (20%) 
Query: 16 CSRKCN RFNKCV 27 
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C RKC+ RF +CV 
Sbjct: 49 CRRKCDRACRFGRCV 63 



> gi I 21288202 I gb I EAAQ0523. 1 | agCP9342 [Anopheles gambiae str. PEST] 
Length = 382 

Score = 25.7 bits (53), Expect = 63 

Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 12/32 (37%) 

Query: 1 CRIPN--QKC FQHLDDCCSRKCNRFNKC 26 

C +P+ +KC F H DDC ++F KG 

Sbjct: 160 CPVPDTCEKCDPTFFPH-DDC DKFYKC 185 



> gi I 17538866 I ref [ NP 501730.11 Zinc finger C4 type Ligand-binding domain of nuclear 
hormone 

receptor like family member [Gaenorhabditis elegans] 
gi I 15718178 I emb | CAC7 0Q72 . 1 | Hypothetical protein G28D4.9 [Gaenorhabditis elegans] 
Length - 420 

Score = 25.2 bits (52), Expect = 85 

Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps - 11/24 (45%) 

Query: 13 DDC CSRKGNRFNKCV 27 

+DG G R G RFNKGV 

Sbjct: 54 EDCEVLQQFRNRC-RAC-RFNKGV 75 



> gi I 17 068 4 27 I gb j AAH17 5 94 . 1 | AAH17 594 Unknown (protein for MGC:714) [Homo sapiens] 
Length = 242 

Score = 25.2 bits (52), Expect = 85 
Identities - 6/7 (85%), Positives - 7/7 (100%) 

Query: 14 DCCSRKC 20 

DCGSR+C 
Sbjct: 10 DCGSRRG 16 



> gi i 17567269 I ref INP 510128.11 P-glycoprotein family member (14 6.9 kD) [Gaenorhabditis 
elegans] 

gi I 7499665 jpir j [ T21268 hypothetical protein F22E10.3 - Gaenorhabditis elegans 
gi i 387 62 93 I emb | CAA918 01 . 1 | C. elegans pgp-14 protein (corresponding sequence F22E10.3) 
[Gaenorhabditis elegans] 
Length = 1327 

Score = 25.2 bits (52), Expect = 85 

Identities = 7/11 (63%), Positives = 8/11 (72%), Gaps = 3/11 (27%) 

Query: 8 GFQHLDDGGSR 18 

GFQH CG+R 
Sbjct: 171 GFQH GCTR 178 
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> gi I 204 54 17 7 I gb | AAM22198 . 1 | AF501277 1 Gal/GalNAc lectin heavy subunit region D 
[Entamoeba histolytica] 
Length = 372 

Score - 25.2 bits (52), Expect = 85 

Identities = 11/18 (61%), Positives - 12/18 (66%), Gaps = 3/18 (16%) 

Query: 13 DDCCSRK--CNRFN-KCV 27 

DDC SRK C FN KC+ 
Sbjct: 92 DDCNSRKSQCGNFNGKCI 109 



> gi I 2133405 I pir I | S70664 lectin heavy chain, N-acetylgalactosamine-specif ic - Entamoeba 
histolytica (fragment) 
gi i 993055 [ qb| AAA7 5552 . 1 | GalNAc lectin heavy subunit 
Length - 194 

Score = 25.2 bits (52), Expect = 85 

Identities - 11/18 (61%), Positives == 12/18 (66%), Gaps - 3/18 (16%) 

Query: 13 DDCCSRK — CNRFN-KCV 27 

DDC SRK C FN KC+ 
Sbjct: 131 DDCNSRKSQCGNFNGKCI 14 8 



> gi I 11359716 I pir i IT46722 conserved hypothetical protein [imported] - Leishmania major 

gi 160664451 emb | CAB58387 . 1 j possible proline synthetase associated protein [Leishmania 
maj or] 

Length - 389 

Score = 25.2 bits (52), Expect = 85 
Identities = 6/7 (85%), Positives - 7/7 (100%) 

Query: 14 DCCSRKC 20 

DCCSR+C 
Sbjct: 8 9 DCCSRRC 95 



> gi I 23510154 I ref I NP 702820.11 RNA-binding protein of pumilio/mptS family, putative 
[ Plasmodium 

falciparum 307] 

gi i 234 98236 I emb | CAD4 92 07 . 1 j RNA-binding protein of pumilio/mpt5 family, putative 
[Plasmodium 

falciparum 3D7] 

gi I 24 8 4 9851 i qb j AAM4 4 411 . 1 | RNA-binding protein Puf2 [Plasmodium falciparum] 
Length = 514 

Score =25.2 bits (52), Expect - 85 
Identities = 7/9 (77%), Positives = 8/9 (88%) 

Query: 6 QKCFQHLDD 14 

QKCF+H DD 
Sbjct: 392 QKCFEHSDD 400 



> gi 1 154 50391 | gb j AAK964 8 9 . 1 | At2gl6530/FlP15 . 9 [Arabidopsis thaliana] 
Length - 343 
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Score = 25.2 bits (52), Expect = 85 

Identities = 8/10 (80%), Positives = 8/10 (80%), Gaps = 1/10 (10%) 

Query: 3 IPNQKCFQHL 12 

IP QKCF HL 
Sbjct: 61 IP-QKCFAHL 69 



> gi|204 5417 9|gb|AAM22199.1 IAF5Q1278 1 Gal/GalNAc lectin heavy subunit region D 
[Entamoeba histolytica] 
Length = 372 

Score = 25.2 bits (52), Expect - 85 

Identities = 11/18 (61%), Positives = 12/18 (66%), Gaps = 3/18 (16%) 

Query: 13 DDCCSRK — CNRFN-KCV 27 

DDC SRK C FN KC+ 
Sbjct: 92 DDCNSRKSQCGNFNGKCI 10 9 



> gi i 6322428 I refjNP 012502.11 Protein required for cell viability; Yjl032wp 
[ Saccharomyces 

cerevisiae ] 

gi I 1352969 I sp I P47059 I YJD2 YEAST HYPOTHETICAL 12.4 KD PROTEIN IN DBP4-BET4 INTERGENIC 
REGION 

gi I 1077805 { pir I IS56804 hypothetical protein YJL032w - yeast (Saccharomyces cerevisiae) 
gi I 1008152 | emb | CAA8 9322 . 1 | ORF YJL032w [Saccharomyces cerevisiae] 
Length = 104 

Score = 25.2 bits (52), Expect = 85 

Identities = 7/12 (58%), Positives = 9/12 (75%), Gaps = 2/12 (16%) 

Query: 9 FQHLDDCCSRKC 20 

+ HL CCSR+C 
Sbjct: 6 YHHL--CCSRRC 15 



> gi i 14250071 I qb | 7VAH08 4 4 0 . 1 ! AAHOB 4 4 0 Similar to CGI-29 protein [Homo sapiens] 
Length = 242 

Score = 25.2 bits (52), Expect - 85 
Identities = 6/7 (85%), Positives = 7/7 (100%) 

Query: 14 DCCSRKC 2 0 

DCCSR+C 
Sbjct: 10 DCCSRRC 16 



> gi I 8 953421 1 emb| CAB9657 2 . 1 | AmphiGli protein [Branchiostoma floridae] 
Length = 1112 

Score =25.2 bits (52), Expect = 85 

Identities = 13/25 (52%), Positives - 13/25 (52%), Gaps = 8/25 (32%) 

Query: 6 QKCFQHLDDC--CS — RKCNRFNKC 2 6 
QKC C CS RKCNR KC 
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Sbjct: 1013 QKC SKCKRCSKCRKCNRCKKC 1033 



> gi[39954Q|sp|P32022|GILl ENTHI GALACTOSE- INHIBITABLE LECTIN 170 KD SUBUNIT 
Length = 1276 

Score = 25.2 bits (52), Expect = 85 

Identities = 11/18 (61%), Positives - 12/18 (66%), Gaps - 3/18 (16%) 

Query: 13 DDCCSRK — CNRFN-KCV 27 

DDC SRK C FN KC+ 
Sbjct: 714 DDCNSRKSQCGNFNGKCI 731 



> gi I 14 2 90597 | gb| AAH0 907 7 . H AAH0 90 7 7 CGI-29 protein [Homo sapiens] 
Length = 242 

Score = 25.2 bits (52), Expect = 85 
Identities = 6/7 (85%), Positives = 7/7 (100%) 

Query: 14 DCCSRKC 20 

DCCSR+C 
Sbjct: 10 DCCSRRC 16 



> gi|7705724iref |NP 057041.11 CGI-29 protein [Homo sapiens] 
gi I 4 68 0697 ! gb [ AAD27 7 38. 1 | AF132 9 63 1 CGI-29 protein [Homo sapiens] 
Length = 242 

Score = 25.2 bits (52), Expect = 85 
Identities - 6/7 (85%), Positives = 1/1 (100%) 

Query: 14 DCCSRKC 20 

DCCSR+C 
Sbjct: 10 DCCSRRC 16 



> gi I 106384 81 |emb|CAC12642. 1 | dJ179L10.2 (Similar to CGI-29 protein) [Homo sapiens] 
Length = 153 

Score =25.2 bits (52), Expect = 85 
Identities = 6/7 (85%), Positives = 7/7 (100%) 

Query: 14 DCCSRKC 2 0 

DCCSR+C 
Sbjct: 10 DCCSRRC 16 



> gi I 7496642 Ipirl IT19540 hypothetical protein C28D4.1 - Caenorhabditis elegans 
Length = 874 

Score =25.2 bits (52), Expect = 85 

Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps = 11/24 (45%) 

Query: 13 DDC CSRKCNRFNKCV 27 

+DC C R C RFNKCV 
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Sbjct: 96 EDCEVLQQFRNRC-RAC-RFNKCV 117 



> gi I 17569651 I ref I NP 510121.11 Nuclear receptor-like, contains ligand-binding domain but 
no 

obvious zinc finger family member [Caenorhabditis 
elegans ] 

gi i 7507364 I pir I IT24648 hypothetical protein T07C5.3 - Caenorhabditis elegans 
qi I 3879535 | emb | CAA9Q297 . 1 | Hypothetical protein T07C5.3 [Caenorhabditis elegans] 
Length = 319 

Score =24.8 bits (51), Expect = 114 

Identities = 8/10 (80%), Positives = 8/10 (80%), Gaps = 1/10 (10%) 

Query: 18 RKCNRFNKCV 27 

R C RFNKCV 
Sbjct: 4 9 RHC-RFNKCV 57 



> gi I 7513590 Ipir I |T42230 AF4 protein - mouse (fragment) 
qi 125820191 gb | AAB82427 . 1 | mAF4 [Mus musculus] 
Length = 1211 

Score =24.8 bits (51), Expect = 114 
Identities = 6/7 (85%), Positives = 7/7 (100%) 

Query: 16 CSRKCNR 22 

CSRKC+R 
Sbjct: 158 CSRKCDR 164 



> gi I 7661970 I ref i NP 055455.11 KIAA0173 gene product [Homo sapiens] 
qi|204553771spiQ14679|Y173 HUMAN Hypothetical protein KIAA0173 

gi 1 11364061 dbji BAAl 14 90.11 similar to pig tubulin-tyrosine ligase. [Homo sapiens] 
Length = 1199 

Score =24.8 bits (51), Expect = 114 
Identities - 6/7 (85%), Positives = 7/7 (100%) 

Query: 12 LDDCCSR 18 

L+DCCSR 
Sbjct: 519 LEDCCSR 525 



> gi i 15 9019 I gb I AAA2 912 6 . 1 | alpha-phosophoinositide-specif ic phospholipase C-like protein 
precursor [Euplotes crassus] 
gi i 22 83 93 |prf | | 18034 2 9A phospholipase C-like protein 
Length =141 

Score = 24.8 bits (51), Expect = 114 

Identities = 10/16 (62%), Positives = 11/16 (68%), Gaps = 5/16 (31%) 

Query: 12 LDDCCSR-K — CNRFN 24 

LD CSR K C+RFN 
Sbjct: 7 8 LD— CSRYKPVCDRFN 91 
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> gill7563970|re£lNP 506967, 1| Zinc finger C4 type family member (47.8 kD) 
[Caenorhabditis 

elegans] 

gi| 7507276|pir [ IT24563 hypothetical protein T06C12.6 - Caenorhabditis elegans 
gi I 38 7 9608 [ emb | CAB032 98 . 1 | Hypothetical protein T06C12.6 [Caenorhabditis elegans] 
Length =414 

Score = 24.8 bits (51), Expect = 114 

Identities = 11/24 (45%), Positives = 12/24 (50%), Gaps = 11/24 (45%) 

Query: 13 DDC CSRKCNRFNKCV 27 

D+C C R C RFNKCV 

Sbjct: 67 DNCAVSSTDRYQC-RLC-RFl^KCV 88 



Get selected sequences 
Select all 
Deselect all 



Database: All non-redundant GenBank CDS 
trans la tions+PDB+SwissProt+PIR+PRF 

Posted date: Nov 25, 2002 1:38 AM 
Number of letters in database: 395,571,179 
Number of sequences in database: 1,242,7 68 

Lambda K H 

0.355 0.288 2.12 



Gapped 

Lambda K H 

0.294 0.110 0.610 



Matrix: PAM30 

Gap Penalties: Existence: 9, Extension: 1 

Number of Hits to DB : 21,155,857 

Number of Sequences: 1242768 

Number of extensions: 470271 

Number of successful extensions: 11171 

Number of sequences better than 20000.0: 10987 

Number of HSP's better than 20000.0 without gapping: 10987 

Number of HSP's successfully gapped in prelim test: 0 

Number of HSP's that attempted gapping in prelim test: 0 

Number of HSP's gapped (non-prelim): 11165 

length of query: 27 

length of database: 395,571,179 

effective HSP length: 18 

effective length of query: 9 

effective length of database: 373,201,355 

effective search space: 3358812195 

effective search space used: 3358812195 

T: 11 

A: 40 

XI: 14 ( 7.2 bits) 
X2: 35 (14.8 bits) 
X3: 58 (24 . 6 bits) 
SI: 34 (19.2 bits) 



(17 . 6 bits) 



Pat Native PVIIA Search 



BLASTP 2.2.4 [Aug-26-2002] 



Reference : 



Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:338 9-34 02. 



RID: 1038287801-011156-13490 



Query= 



(27 letters) 



Database: Protein sequences derived from the Patent division of 
GenBank 

104,350 sequences; 16,453,681 total letters 



If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 



Taxonomy reports 



Distribution of 191 Blast Hits on the Query Sequence 



Sequences producing significant alignments: 
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E 
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gi 
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0. 


76 


gi 
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0. 


76 


gi 
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3 


gi 
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3 


gi 
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gi 
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gi 
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25 


4 . 


4 


gi 
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gi 


14035826 i emb j CAC38509 . 1 j unnamed protein product [Homo s... 


2 1 


47 


gi 


5 92200 [ gb 1 AAA5 3807 . 1 i Sequence 4 from Patent WO 8909834 


2 1 


47 


gi 


3210124 1 gb 1 AAC21068 . 1 1 181827 Sequence 5 from patent US 5... 


21 


4 7 


gi 


1 uu b / 4 yj 1 go j AAEb4 b y y . 1 1 sequence i4 rrom patent ub oy/ foVD 


9 1 
Z 1 


/I 7 
4 / 


gi 


3 994 04 7'| gb j AAC874 75 . 1 1 AR0078 4 1 Sequence 4 from patent US... 


2 1 


4 7 


gi 


4 00008 1 1 gb ! AAC935 13 . 1 1 AR0232 15 Sequence 53 from patent U... 


21 


4 7 


gi 


z u y bbz z 1 go i AABb^ bo 4 . i i bequence z o rrom parenr Ub Dbzzoz... 


9 1 
Z X 


A 7 
4 / 


gi 


9123 66 1 gb I AAA7 1820 . 1 1 Sequence 25 from patent US 5429921... 


2 1 


4 7 


gi 


z Uz bo 1 / y 1 go 1 AAiYii / 4 1 b . 1 1 bequence 4y rrom parenr ub bj5biyD4 


9 1 
z 1 


/ 7 
4 / 


gi 


5 94 68 6 9 1 gb j AAE037 7 4 . 1 1 Sequence zb rrom patent US bobbb4... 


A T 

Z 1 


4 7 


gi 


b 9 4 bo 61 1 gb I AAE03 / bb . 1 1 Sequence / rrom patent US bobbb4z,.. 


A 1 
Z 1 


4 / 


gi 


zOybbzb 1 gb 1 AABb4bbb . 1 1 bequence z4 trom patent Ub bbzzoz... 


A 1 
Z 1 


A 1 

4 / 


gi 


90QC:R1RI^V\l7XnRR/1'3/17 1 1 Qciz-^nd-nr-^ci -F v/^m r^=l■^-dr^■^- FIQ ^^^99(59 

z u y b b 1 0 1 go j /^/ixso ^ o ^ / . i \ bequence id r rom parenr ub ddzzoz... 


9 1 
Z 1 


A 7 
4 / 


gi 


5946859 1 gb IAAE03764 . 1 i Sequence 4 from patent US 5866542... 


"21 


47 


gi 


20246635|gb| AAE95293.il Sequence 111 from patent US 6350933 


21 


63 


gi 


2489301|gbiAAB77437.1|I58938 Sequence 23 from patent US . . . 


21 


63 


gi 


14118421 1 gbjAAE59660 . 1 1 Sequence 90 from patent US 6187548 


21 


63 


gi 


2489300 1 gb IAAB77436. 1II58937 Sequence 10 from patent US ... 


21 


63 


gi 


5946881|gb|AAE03786.1| Sequence 47 from patent US 586654... 


21 


63 



3 



17912954|gb|AA£83208.1| 



212 607 7 4 i emb | CAD325 99. 1 | 
14112251 I gb I AAE57 668 . 1 [ 



11191607 I emb | CAC1634 4 . 1 | 
12807910|gb|AAE4313Q . 1 ! 



14118419|gb|AAE59658.1 | 
162 362 95 I gb! AAE7 8102 . 1 



Sequence 
Sequence 

Sequence 2 

141184 20 I gb I AAE5 965 9. 1 I Sequence 



270588 I gbj AAA02097 . 1 | 



3 99218 5 Igbl AAC85 612. 1 IARQ04 027 



1512 4 7 58 I gb I AAE73219. 1 | 
6733930 |emb|CAB69362. 1 | 



1018 4 354 i emb | CAC08 9Q0 . 1 | 
14118418 |gb|AAE59657 . 1 | 



218 9 9919 I emb [CAD4 2378 . 1 | 



183Q486|gb|AAB44940.1i 
15121888[gb[AAE72568 . 1 | 



1831302 Igbl AAB45756. 1 | 



1611873|gb|AAB14852 
592412 |gb|AAA54019 . 1 



II 



10063187 I gb I AAE3812 7 . 1 | 



Sequence 33 from patent US 6303295 
unnamed protein product [Drosop. . . 
Sequence 40 from patent US 6177075 
unnamed protein product [uniden. . . 
Sequence 2 from patent US 6074840 
86 from patent US 6187548 
9 from patent US 627098... 
from Patent US 4879374 
88 from patent US 6187548 
Sequence 2 from patent US... 
Sequence 84 from patent US 6239270 
unnamed protein product [unident... 

fusion between Aequorea victori... 
Sequence 84 from patent US 6187548 
unnamed protein product [Homo s... 
Sequence 29 from patent US 558745... 

Sequence 4 from patent 
Sequence 3 from patent 
Sequence 5 from patent 
Sequence 2 from Patent 



US 6236946 
US 5591821 
US 5541311. . 
WO 8905355 . . 



1831307 I gb I AAB45761 
15864475|emb[CAC88676 



214 38 922 I emb | CAD34 84 8 . 1 | 



Sequence 16 from patent US 5993827 
II Sequence 13 from patent US 5591821 
1| unnamed protein product [Homo s.. 
unnamed protein product [Homo s.., 



21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


63 


21 


84 


21 


84 


21 


84 


21 


84 


21 


84 


21 


84 


21 


84 


21 


84 


21 


84 



Alignments 

Get selected sequences 
Select all 
Deselect all 



> gi I 1410710 9 Igbl AAE5 5 9 63. 1 | 
gi I 14107110 Igbl AAE55 964 . 1 | 
gi I 141182 611 gbj AAE5 9500 . 1 | 
Length = 1292 



Sequence 5 from patent US 6165469 
Sequence 6 from patent US 6165469 
Sequence 5 from patent US 6187310 



Score = 27.4 bits (57), Expect = 0.76 

Identities = 12/18 (66%), Positives = 12/18 (66%), Gaps 



3/18 (16%) 



Query: 13 DDCCSRK--CNRFN-KCV 27 

DDC SRK C FN KCV 
Sbjct: 7 30 DDCNSRKSQCGNFNGKCV 74 7 



> gi I 141182 62 ! gb | AAE5 9501 . 1 | Sequence 6 from patent US 6187310 
Length = 1277 

Score = 27.4 bits (57), Expect = 0.76 

Identities - 12/18 (66%), Positives = 12/18 (66%), Gaps = 3/18 (16%) 

Query: 13 DDCCSRK--CNRFN-KCV 27 

DDC SRK C FN KCV 
Sbjct: 715 DDCNSRKSQCGNFNGKCV 732 



> gi I 14118 2 60 igbl AAE5 94 9 9. 1 | Sequence 3 from patent US 6187310 



4 



Length - 1276 
Score = 25.2 bits (52), Expect = 3.3 

Identities - 11/18 (61%), Positives = 12/18 (66%), Gaps = 3/18 (16%) 

Query: 13 DDCCSRK — CNRFN-KCV 27 

DDC SRK C FN KC+ 
Sbjct: 714 DDCNSRKSQCGNFNGKCI 7 31 



> gi I 14107107 I qb !7VAE55961 . 1 | Sequence 2 from patent US 6165469 
Length = 1295 

Score - 25.2 bits (52), Expect = 3.3 

Identities = 11/18 (61%), Positives - 12/18 (66%), Gaps = 3/18 (16%) 

Query: 13 DDCCSRK — CNRFN-KCV 27 

DDC SRK C FN KC+ 
Sbjct: 729 DDCNSRKSQCGNFNGKCI 74 6 



> gi [14107108 I qb | AAE55962 . 1 | Sequence 3 from patent US 61654 69 
gi|14118259| gb | AAE5 94 98 . 1 | Sequence 2 from patent US 6187310 
Length = 1291 

Score = 25.2 bits (52), Expect = 3.3 

Identities = 11/18 (61%), Positives = 12/18 (66%), Gaps = 3/18 (16%) 

Query: 13 DDCCSRK — CNRFN-KCV 27 

DDC SRK C FN KC+ 
Sbjct: 72 9 DDCNSRKSQCGNFNGKCI 74 6 



> gi I 202 68211 I gb | AAM174 48 . 1 j Sequence 
Length =4 69 

Score -24.8 bits (51), Expect =4.4 
Identities = 12/28 (42%), Positives = 

Query: 6 QKCF-Q HLDDCCSRKC NRFN 

Q+CF Q +LDDC C +RFN 
Sbjct: 33 QRCFCQVSGYLDDC TCDVETIDRFN 



29 from patent US 6361964 

15/28 (53%), Gaps - 12/28 (42%) 

24 

57 



> gi I 157 97 4 74 I emb { CAC8 8 318 . 1 | unnamed protein product [Homo sapiens] 
Length = 596 

Score =24.8 bits (51), Expect - 4.4 
Identities - 7/8 (87%), Positives = 7/8 (87%) 

Query: 8 CFQHLDDC 15 

CFQ LDDC 
Sbjct: 253 CFQILDDC 260 



> gi I 21715796 I emb | CAD38 25 6 . 1 1 unnamed protein product [Homo sapiens] 



5 



Length = 537 
Score =24.8 bits (51), Expect - 4.4 

Identities = 10/19 (52%), Positives = 11/19 (57%), Gaps = 5/19 (26%) 

Query: 11 HLDDCCS RKCNRFN 24 

HL D S RKC+RFN 
Sbjct: 412 HLGDGSSDLILIRKCSRFN 430 



> gi I 21886477 | emb | CAD4 2152 . 1 | unnamed protein product [Homo sapiens] 
Length = 377 

Score =24.4 bits (50), Expect =6.0 
Identities = 8/11 (72%), Positives = 8/11 (72%) 

Query: 12 LDDCCSRKCNR 22 

LDDC SR C R 
Sbjct: 207 LDDCASRPCQR 217 



Score = 18.9 bits (37), Expect = 272 

Identities = 6/10 (60%), Positives = 7/10 (70%), Gaps = 1/10 (10%) 

Query: 13 DDCCSRKCNR 22 

DDC S C+R 
Sbjct: 27 DDCSSH-CDR 35 



> gi {2 302756 1 emb ! GAAP 3 163 . 1 | unnamed protein product [Eimeria maxima] 
gi I 100 6715 6 I qb | AAE4 014 6 . 1 | Sequence 2 from patent US 6001363 
Length = 109 

Score =24.0 bits (49), Expect =8.0 

Identities = 10/19 (52%), Positives = 10/19 (52%), Gaps = 9/19 (47%) 

Query: 7 KCFQHLDDCCSRK--CNRF 23 

KG CCS K CNRF 

Sbjct: 52 KG CCS-KFGCNRF 63 



Score =17.6 bits (34), Expect = 658 
Identities = 4/4 (100%), Positives = 4/4 (100%) 

Query: 15 GGSR 18 
GCSR 

Sbjct: 68 GCSR 71 



Score = 16.3 bits (31), Expect = 1589 
Identities = 4/6 (66%), Positives = 4/6 (66%) 

Query: 15 CGSRKG 20 

GG KG 
Sbjct: 33 GGGSKG 38 



6 



Score - 16.3 bits (31), Expect = 1589 
Identities = 4/6 (66%), Positives = 4/6 (66%) 

Query: 15 CCSRKC 20 

CC KC 
Sbjct: 28 CCCSKC 33 



> gi I 1624 388 3 I qb | AAE8 03 95 . 1 j Sequence 8 from patent US 6284253 
Length =4 98 

Score =24.0 bits (49), Expect =8.0 

Identities = 15/34 (44%), Positives = 16/34 (47%), Gaps = 12/34 (35%) 

Query: 2 RIPNQ — KCFQ HLD-DC-CSRKCNRFNKC 26 

R P Q KCF HL +C RKC NKC 
Sbjct: 4 03 RRPGQSLKCFNCGKPGHLARNCRAPRKC NKC 4 33 



> gi I 4 5307 97 | emb | GAAP 38 2 4 . 1 | unnamed protein product [unidentified] 
gi I 4 5308 07 i emb | CAAQ382 8 . 1 1 unnamed protein product [unidentified] 
Length = 95 

Score = 23.5 bits (48), Expect = 11 

Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 13/29 (44%) 

Query: 8 CFQHLD DCCSRKC--NRF NKCV 27 

CF+HLD +C KC N + +KCV 
Sbjct: 20 CFRHLDEREEC KCLLN- YKQEGDKCV 4 4 



> gi [ 34 09411 j qb | AAC3Q4 66 . 1 j 18 94 71 Sequence 1 from patent US 5720959 
Length =48 

Score = 23.5 bits (48), Expect - 11 

Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 13/29 (44%) 

Query: 8 CFQHLD DCCSRKC — NRF NKCV 27 

CF+HLD +C KC N + +KCV 
Sbjct: 18 CFRHLDEREEC KCLLN-YKQEGDKCV 4 2 



> gi I 4530805 | emb [CAA03827 . 1 1 unnamed protein product [unidentified] 
gi I 4 530815 I emb ICAA03831 . 1 | unnamed protein product [unidentified] 
Length = 108 

Score = 23.5 bits (48), Expect = 11 

Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 13/29 (44%) 

Query: 8 CFQHLD DCCSRKC--NRF NKCV 2 7 

CF+HLD +C KC N + +KCV 
Sbjct: 33 CFRHLDEREEC KCLLN-YKQEGDKCV 57 



7 



> gi|399914 3|gb[AAC92575,l IAR021637 Sequence 3 from patent US 5789198 
Length = 145 

Score = 23.5 bits (48), Expect = 11 

Identities - 10/19 (52%), Positives = 10/19 (52%), Gaps = 5/19 (26%) 

Query: 1 CRIPNQKCFQHLDDCCSRK 19 

CI Q CF DCCS K 
Sbjct: 3 CHIHIQ-CF DCCSMK 16 



> gi I 4 5308 00 I emb | CAA03825 . 1 | unnamed protein product [unidentified] 
gi i 4 530 810 [emb|CAA0382 9. 1 | unnamed protein product [unidentified] 
Length ^ 116 

Score = 23.5 bits (48), Expect = 11 

Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 13/29 (44%) 

Query: 8 CFQHLD DCCSRKC--NRF NKCV 27 

CF+HLD +C KG N + +KCV 
Sbjct: 20 CFRHLDEREEC KCLLN-YKQEGDKCV 4 4 



> gi| 34 0 9412 |qb|AAC304 67.1 I 18 94 72 Sequence 2 from patent US 5720959 
Length - 48 

Score - 23.5 bits (48), Expect = 11 

Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 13/29 (44%) 

Query: 8 CFQHLD DCCSRKC--NRF NKCV 27 

CF+HLD +C KG N + +KCV 
Sbjct: 18 CFRHLDEREEC KCLLN-YKQEGDKCV 4 2 



> gi I 34 0 9416 1 qb|AAC3Q4 71. 1 | 18 94 7 6 Sequence 12 from patent US 5720959 
Length = 48 

Score = 23.5 bits (48), Expect - 11 

Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 13/29 (44%) 

Query: 8 CFQHLD DCCSRKC — NRF NKCV 2 7 

CF+HLD +C KG N + +KCV 
Sbjct: 18 CFRHLDEREEC KCLLN-YKQEGDKCV 4 2 



> gi I 4 530803 I emb j CAA0382 6 . 1 | unnamed protein product [unidentified] 
gi j 4 530813 i emb | CAAQ3830. 1 | unnamed protein product [unidentified] 
Length - 127 

Score = 23.5 bits (48), Expect - 11 

Identities - 12/29 (41%), Positives - 16/29 (55%), Gaps - 13/29 (44%) 

Query: 8 CFQHLD DCCSRKC--NRF NKCV 2 7 

CF+HLD +C KG N + +KCV 
Sbjct: 52 CFRHLDEREEC KCLLN-YKQEGDKCV 7 6 



8 



> gi 120221551 I qb I AAE87124 .11 Sequence 11 from patent US 6326165 
Length ^ 716 

Score - 23.1 bits (47), Expect = 14 
Identities - 6/7 (85%), Positives = 6/7 (85%) 

Query: 8 CFQHLDD 14 

CF HLDD 
Sbjct: 4 39 C'FMHLDD 445 



> gi| 12822272 |qb|AAE4 8713.1 I Sequence 6 from patent US 6107088 
Length = 178 

Score = 23.1 bits (47), Expect - 14 
Identities = 7/14 (50%), Positives = 8/14 (57%) 

Query: 3 IPNQKCFQHLDDCC 16 

IP K F H+ CC 
Sbjct: 160 IPENKYFHHMGKCC 173 



> gi I 2 022154 6 i qb| AAE8 7119. 1 j Sequence 4 from patent US 6326165 
Length = 737 

Score - 23.1 bits (47), Expect = 14 
Identities = 6/7 (85%), Positives = 6/7 (85%) 

Query: 8 CFQHLDD 14 

CF HLDD 
Sbjct: 4 60 CFMHLDD 4 66 



> gi| 12822270 |qb|AAE4 8711,l I Sequence 2 from patent US 6107088 
Length = 317 

Score = 23.1 bits (47), Expect = 14 
Identities - 7/14 (50%), Positives = 8/14 (57%) 

Query: 3 IPNQKCFQHLDDCC 16 

IP K F H+ CC 
Sbjct: 160 IPENKYFHHMGKCC 173 



> gi I 2022154 7 igb|AAE8 712Q. 1 \ Sequence 5 from patent US 6326165 
Length = 716 

Score - 23.1 bits (47), Expect = 14 
Identities = 6/7 (85%), Positives = 6/7 (85%) 



Query: 8 CFQHLDD 14 

CF HLDD 
Sbjct: 439 CFMHLDD 445 
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> qi I 16242616 I gb| AAE79920 . 1 | Sequence 20 from patent US 6280975 
Length = 13 

Score = 22.7 bits (46), Expect - 19 
Identities = 5/7 (71%), Positives - 7/7 (100%) 

Query: 20 CNRFNKC 2 6 

C++FNKC 
Sbjct: 2 CDKFNKC 8 



> qi I 410755 I emb | CAA0Q367 . 1 ! P. 195 [Plasmodium falciparum] 
Length = 1654 

Score - 22.7 bits (46), Expect = 19 

Identities - 8/16 (50%), Positives = 10/16 (62%), Gaps - 6/16 (37%) 

Query: 8 CFQHLD DCCSRKC 20 

CF+HLD +C KC 
Sbjct: 1558 CFRHLDEREEC KC 1570 



Score = 16.8 bits (32), Expect = 1184 
Identities = 4/4 (100%), Positives = 4/4 (100%) 

Query: 22 RFNK 25 
RFNK 

Sbjct: 1332 RFNK 1335 



> gi I 15118 603 | gb { AAE7 0 697 . 1 | Sequence 100 from patent US 6232061 
Length = 520 

Score = 22.3 bits (45), Expect = 26 

Identities = 7/10 (70%), Positives = 7/10 (70%), Gaps = 2/10 (20%) 

Query: 8 CFQH— LDDC 15 

CFQH LD C 
Sbjct: 317 CFQHMILDTC 326 



> gi ! 2333053 9 I qb | AAN2 6530 . 1 | Sequence 2 from patent US 6432408 
Length = 1708 

Score = 22.3 bits (45), Expect = 26 

Identities = 7/10 (70%), Positives - 7/10 (70%), Gaps = 3/10 (30%) 

Query: 12 LDD CCSR 18 

LDD CCSR 
Sbjct: 330 LDDQAFCCSR 339 



> gi I 151417 55 i emb | CAC5Q180 . 1 | Orfl [Hepatitis E virus] 
Length = 1698 
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Score - 22.3 bits (45), Expect - 26 

Identities = 7/10 (70%), Positives = 7/10 (70%), Gaps = 3/10 (30%) 

Query: 12 LDD CCSR 18 

LDD CCSR 
Sbjct: 321 LDDQAFCCSR 330 



> qi I 1254 37 2 I gb | AAA9562 6. 1 | Sequence 6 from patent US 5482928 
gi i 30120 65 I gb I AAC11513. 1 II7592 4 Sequence 6 from patent US 5689048 
Length = 25 

Score = 22.3 bits (45), Expect = 26 

Identities = 8/13 (61%), Positives = 9/13 (69%), Gaps = 1/13 (7%) 

Query: 14 DCCSRKCNRENKC 2 6 

DCC+ CNR KC 
Sbjct: 14 DCCTGSCNR-GKC 25 



> gi i 20227333 I gb | AAE8 937 4 . 1 | Sequence 36 from patent US 6340740 
gi 1 2 3330 622 I gb | AAN2 6602 . 1 | Sequence 31 from patent US 6432628 
Length =47 

Score = 22.3 bits (45), Expect = 26 

Identities = 7/16 (43%), Positives = 9/16 (56%), Gaps = 7/16 (43%) 

Query: 7 KCFQHL DDC 15 

KCFQ+L +DC 
Sbjct: 18 KCFQNLGFEVTVHNDC 33 



> gi I 32 0 990 4 I gb | AAC2084 8 . 1 | 181607 Sequence 2 from patent US 5710019 
Length = 513 

Score =22.3 bits (45), Expect = 26 
Identities = 6/8 (75%), Positives = 6/8 (75%) 

Query: 11 HLDDCCSR 18 

HLD CC R 
Sbjct: 160 HLDGCCKR 167 



> gi i 3 9915 99 I gb | AAC8 502 6 . H AR002 030 
Length = 4 54 4 

Score = 22.3 bits (45), Expect = 
Identities = 10/19 (52%), Positives 

Query: 11 HLDDCCSRKC--NRFN-KC 2 6 

HL CS+KC lSi+F+ KC 
Sbjct: 1230 HLK--CSQKCDQNKFSVKC 124 6 



Sequence 52 from patent US 5739281 
26 

= 13/19 (68%), Gaps = 5/19 (26%) 



Score = 20.2 bits (40), Expect = 113 

Identities = 11/26 (42%), Positives = 14/26 (53%), Gaps = 7/26 (26%) 
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Query: 1 CRIPNQKCFQHLDDCCSRKCNRFNKC 26 

CR NQ+C L S +CN F+ C 
Sbjct: 37 4 9 CR--NQRC L--SSSLRCNMFDDC 37 67 



Score =17.2 bits (33), Expect = 883 
Identities = 4/4 (100%), Positives = 4/4 (100% 

Query: 21 NRFN 2 4 
NRFN 

Sbjct: 4 43 NRFN 44 6 



Score =14.6 bits (27), Expect = 5150 
Identities - 3/4 (75%), Positives = 4/4 (100%) 

Query: 12 LDDC 15 
LD+C 

Sbjct: 2339 LDEC 2342 



Score = 13.4 bits (24), Expect = 12442 
Identities = 4/5 (80%), Positives = 4/5 (80%) 

Query: 17 SRKCN 21 

SR CN 
Sbjct: 89 SRLCN 93 



> gi 1 15116726 i gbj AAE7 0252 . 1 \ Sequence 7 from patent US 6229005 
gi I 21508 67 0 I gb | AAM5 8155 . 1 | Sequence 7 from patent US 6379891 
Length = 1693 

Score =22.3 bits (45), Expect = 26 

Identities = 7/10 (70%), Positives = 7/10 (70%), Gaps = 3/10 (30%) 

Query: 12 LDD CCSR 18 

LDD CCSR 
Sbjct: 330 LDDQAFCCSR 339 



> qi I 14 484 312 I gb j AAE634 97 . 1 | Sequence 1 from patent US 6207416 
gij 17 9056111gblAAE8Q914 .1 1 Sequence 1 from patent US 6287759 
Length = 1693 

Score =22.3 bits (45), Expect = 26 

Identities - 7/10 (70%), Positives = 7/10 (70%), Gaps = 3/10 (30%) 

Query: 12 LDD CCSR 18 

LDD CCSR 
Sbjct: 330 LDDQAFCCSR 339 



> gi I 161352 5 Iqbl AAB165 05. 1 | Sequence 2 from patent US 5559095 
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gi i 18304 59 I gb | AAB4 4 913 . 1 j Sequence 2 from patent US 5587454 

gi I 4000574 igb!AAC94 00 6. HAR024 74 8 Sequence 2 from patent US 5795864 



gi I 5 94 25 69 I qb | AAE02101 , 1 [ 
gi I 5 972562 |gb[7yVE12453. 1 | 
gi I 7223963 I gb | AAE24 130 . 1 | 
gi I 10054 011 | gb| AAE30918 . 1 | 
gi I 10064 28 4 I gb| AAE387 08 . 1 [ 
gi i 12815 62 61 gbl AAE4 54 35. 1 | 
gi I 14 4 77 8 64 ! gb| AAE614 27 . 1 j 
Length - 25 



Sequence 2 from patent US 5859186 
Sequence 2 from patent US 5824 645 
Sequence 2 from patent US 589184 9 
Sequence 2 from patent US 5965534 
Sequence 2 from patent US 5994305 
Sequence 2 from patent US 6087091 
Sequence 2 from patent US 6136786 



Score =22.3 bits (45), Expect = 26 

Identities = 8/13 (61%), Positives = 9/13 (69%), Gaps - 1/13 (7%) 

Query: 14 DCCSRKCNRFNKC 2 6 

DCC+ CNR KC 
Sbjct: 14 DCCTGSCNR-GKC 25 



> gi I 161354 2 I qb | AAB1652 2 . 1 1 Sequence 19 from patent US 5559095 
gi|183Q476| gb | AAB4 4 930. 1 [ Sequence 19 from patent US 5587454 
gil 4 000591 {gb|AAC94 023. 1 IAR02 4 7 65 Sequence 19 from patent US 5795864 
gi I 5 9425 86 I gb | AAE02 118 . 1 | Sequence 19 from patent US 5859186 

Sequence 19 from patent US 5824645 
Sequence 19 from patent US 5891849 



gi I 5 942586 I gb | AAE02 118 . 1 | 
gi I 5 97257 9 Iqb I AAE124 70. 1 . 
gi I 7223 98 0 | qb | AAE2 4 14 7 . 1 | 
gi I 100 64 318 I gb | AAE38 725 . 1 | 
gil 12815 64 3|gb|AAE4 54 52. 1 j 
gi|14477881| qb| AAE61444 . 1 | 
Length = 26 



Sequence 19 from patent US 5994305 
Sequence 19 from patent US 6087091 
Sequence 19 from patent US 6136786 



Score = 21.8 bits (44), Expect = 35 
Identities = 8/13 (61%), Positives = 8/13 (61%) 

Query: 14 DCCSRKCNRFNKC 2 6 

DCCS C R KC 
Sbjct: 14 DCCSGSCGRSGKC 2 6 



> gi ! 1613531 | qb | AAB16511 . 1 | Sequence 8 from patent US 5559095 
gi I 18304 65 ! gb | AAB4 4 91 9 . 1 | Sequence 8 from patent US 5587454 
gi I 4000580 [ gb ' ^ " .,.^^^.^r-. ^ ^ . 

gi I 5942575 | qb , . , 

gi I 5 9725 68 ! qb | AAE124 59 . 1 | 
gi j 7223 969 | qb | AAE24 136 . 1 j 
gi [ 100 64 2 96 I qb | AAE38714 . 1 | Sequence 
gil 12 815632 Iqb iAAE454 41. 1 | Sequence 
gi|14477870| qb | AAE61 4 33 . 1 | Sequence 
Length =26 



I -uw^v^^v^^ ) -^J^i jr:ur^i^ -a -a j_ .y . j. | >4 LA CI ik^ c: (J J_ X. ^JlLl l^aUtiiiL UO -J J O / 4 □ 4 

gi I 4 00058 0|gb| AAC94 012. 1 IAR02 4 754 Sequence 8 from patent US 5795864 
gi I 5 94 2575 | qb | AAEQ2107 . 1 | Sequence 8 from patent US 5859186 
ail5972568iablAAE12459,l I Sequence 8 from patent US 5824645 

Sequence 



from patent US 5891849 
from patent US 5994305 
from patent US 6087091 
from patent US 6136786 



Score - 21.8 bits (44), Expect - 35 
Identities = 8/13 (61%), Positives = 8/13 (61%) 

Query: 14 DCCSRKCNRFNKC 26 

DCCS C R KC 
Sbjct: 14 DCCSGSCGRSGKC 2 6 



> qi I 14 092 871 1 qb | AAE51612 . 1 | Sequence 4 from patent US 6130039 



13 



gi i 180 93808 | emb|CAD20397 . 1 1 unnamed protein product [Homo sapiens] 
Length - 727 

Score = 21.8 bits (44), Expect - 35 

Identities = 7/13 (53%), Positives = 8/13 (61%), Gaps = 5/13 (38%) 

Query: 1 CRIPNQKCFQHLD 13 

CRI FQ+LD 
Sbjct: 204 CRI FQNLD 211 



> gi 14 000083 1 gb | AAC93515 . 1 | AR02 3217 Sequence 55 from patent US 5792846 
qi I 599 999 9[gb|AAE21903. 1 ( Sequence 55 from patent US 5851824 
Length - 1079 

Score = 21.4 bits (43), Expect = 47 
Identities = 5/5 (100%), Positives - 5/5 (100%) 

Query: 8 CFQHL 12 
CFQHL 

Sbjct: 303 CFQHL 307 



> gi I 7 224006 I gb | AAE24173 . 1 | Sequence 2 from patent US 5891857 
gi I 14101230 I gb I AAE5354 7 . 1 | Sequence 2 from patent US 6149903 
gi I 14112 621 j qb!AAE58 038 . 1 | Sequence 2 from patent US 6177410 
gi { 2022 934 3 I gb | AAE904 8 6 . 1 | Sequence 4 9 from patent US 6342483 
Length = 1863 

Score = 21.4 bits (43), Expect = 47 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 8 CFQHL 12 
CFQHL 

Sbjct: 1225 CFQHL 1229 



Score = 16.8 bits (32), Expect = 1184 
Identities = 4/4 (100%), Positives - 4/4 (100%) 

Query: 24 NKCV 27 
NKCV 

Sbjct: 799 NKCV 802 



> gi I 23342653 | emb | CAD4 8 690 . 1 | unnamed protein product [Homo sapiens] 

gi|23342655| emb | CAD4 8 691 . 1 | unnamed protein product [Homo sapiens] 

gi|23342659| emb | CAD4 8 6 93 . 1 j unnamed protein product [Homo sapiens] 
Length = 457 

Score = 21.4 bits (43), Expect = 47 
Identities = 7/10 (70%), Positives - 7/10 (70%) 

Query: 3 IPNQKCFQHL 12 

IP QKC HL 
Sbjct: 291 IPEQKCNPHL 300 
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> gi I 5 94 68 64 I gb | AAE037 69 . 1 \ 
qi I 5 94 7165 I qb| AAE04 07Q . 1 | 
qi I 5 954 014 | gb | AAEQ 6518 . 1 | 
qi I 100504 08 | gb| AAE27 315 . 1 | 
gi ! 1281634 5 I gb|7VAE4 6154 . 1 | 
Length = 100 



Sequence 20 from patent US 5866542 
Sequence 20 from patent US 5866543 
Sequence 20 from patent US 5872098 
Sequence 20 from patent US 5955294 
Sequence 20 from patent US 6087487 



Score = 21.4 bits (43), Expect = 47 

Identities = 8/15 (53%), Positives - 8/15 (53%), Gaps 



5/15 (33%) 



Query: 12 LDDC- 
LDDC 



-CSRKCN 21 
C KCN 



Sbjct: 35 LDDCGTQKPCEAKCN 4 9 



> gi |400Q084|qb| AAC93516 . 1 | AR023218 Sequence 56 from patent US 5792846 
gi|6000000|gb| AAE219Q4 . 1 | Sequence 56 from patent US 5851824 
Length - 1084 

Score = 21.4 bits (43), Expect = 47 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 8 CFQHL 12 
CFQHL 

Sbjct: 303 CFQHL 307 



> gi I 32 09905 1 gb | AAC2 084 9. 1 | 181608 Sequence 4 from patent US 5710019 
Length =4 94 

Score = 21.4 bits (43), Expect = 47 
Identities = 5/5 (100%), Positives = 5/5 (100%) 



Query: 12 ' LDDCC 16 
LDDCC 

Sbjct: 123 LDDCC 127 



> gi I 1410034 3 I qb| AAE534 83. 1 | Sequence 4 from patent US 6140485 
Length = 1091 

Score = 21.4 bits (43), Expect - 47 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 8 CFQHL 12 
CFQHL 

Sbjct: 303 CFQHL 307 



> gi I 5 94 68 96! qb| AAE038 01. 1 I 
gi I 5 947197 ! gb | AAE04102 . 1 [ 
gi| 5 954 04 6 igb I AAEQ 65 50. 1 | 
gi I 100504 40 I gbj AAE27 34 7 . 1 | 
gi| 12816377 | gb ! AAE4 618 6 . 1 | 
Length = 171 



Sequence 62 from patent US 5866542 
Sequence 62 from patent US 5866543 
Sequence 62 from patent US 5872098 
Sequence 62 from patent US 5955294 
Sequence 62 from patent US 6087487 
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Score = 21.4 bits (43), Expect = 47 

Identities = 7/13 (53%), Positives = 7/13 (53%), Gaps = 5/13 (38%) 

Query: 14 DC CSRKCN 21 

DC C RKCN 

Sbjct: 101 DCGNDKQCERKCN 113 



> gi I 15141758 | emb | CAC50182 . 1 | Orfl [Hepatitis E virus] 
Length = 1708 

Score = 21.4 bits (43), Expect = 47 

Identities = 6/10 (60%), Positives = 7/10 (70%), Gaps - 3/10 (30%) 

Query: 12 LDD CCSR 18 

LED CCSR 
Sbjct: 330 LBDQAFCCSR 339 



> gi I 2334 2 657 I emb I CAD4 8 692 . 1 I unnamed protein product [Homo sapiens] 
Length = 457 

Score = 21.4 bits (43), Expect = 47 
Identities = 7/10 (70%), Positives = 7/10 (70%) 

Query: 3 IPNQKCFQHL 12 

IP QKC HL 
Sbjct: 291 IPEQKCNPHL 300 



> gi I 3014 969 1 gbl AAC118 4 9. 1 | 178815 Sequence 2 from patent US 5693473 

gi I 32098 67 | gb | AAC20811 . 1 | 18157 0 Sequence 2 from patent US 5709999 

gi I 320 987 3 I qb | AAC20817 . 1 | 18157 6 Sequence 2 from patent US 5710001 

gi I 3 992 67 6 [ gb I7VAC8 6103 . 1 [ AR005 62 0 Sequence 2 from patent US 5747282 

gi I 3 994 27 0 I gb | AAC8 7 6 98 . 1 | AR008 67 2 Sequence 2 from patent US 5753441 
gi|14478556| gb | AAE617 97 . 1 | Sequence 2 from patent US 6162897 
Length = 1863 

Score =21.4 bits (43), Expect = 47 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 8 CFQHL 12 
CFQHL 

Sbjct: 1225 CFQHL 1229 



Score = 16.8 bits (32), Expect = 1184 
Identities - 4/4 (100%), Positives = 4/4 (100%) 

Query: 24 NKCV 27 
NKCV 

Sbjct: 799 NKCV 802 



Get selected sequences 



16 



Select all 
Deselect all 



Database: Protein sequences derived from the Patent division of 
GenBank 

Posted date: Nov 25, 2002 1:06 AM 
Number of letters in database: 16,453,681 
Number of sequences in database: 104,350 

Lambda K H 

0.355 0.288 2.12 

Gapped 

Lambda K H 

0.294 0.110 0.610 



Matrix: PAM30 

Gap Penalties: Existence: 9, Extension: 1 
Number of Hits to DB : 912,208 
Number of Sequences: 104350 
Number of extensions: 21292 
Number of successful extensions: 3537 
Number of sequences better than 20000.0: 3253 
Number of HSP's better than 20000.0 without gapping: 3253 
Number of HSP's successfully gapped in prelim test: 0 
Number of HSP's that attempted gapping in prelim test: 0 
Number of HSP's gapped (non-prelim): 3536 
length of query: 27 
length of database: 16,453,681 
effective HSP length: 18 
effective length of query: 9 
effective length of database: 14,575,381 
effective search space: 131178429 
effective search space used: 131178429 
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NR Gen PVIIA Search 



BLASTP 2.2.4 [Aug-26-2002] 

Reference : 

Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database se 
programs", Nucleic Acids Res. 25:338 9-34 02. 

RID: 1038288080-013595-3111 

QuerY= 

(27 letters) 

Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,242,768 sequences; 395,571,179 total letters 

If you have any problems or questions with the results of this searc 
please refer to the BLAST FAQs 

Taxonomy reports 



Distribution of 150 Blast Hits on the Query Sequence 



Score E 

Sequences producing significant alignments: (bits) Value 

qi I 3891872 Ipdbl IKCPl 3d Structure Of K-Conotoxin Pviia, A . . . _45 8e-05 

qi I 7519866|pir 1 I A58997 kappa-conotoxin PVIIA - cone shell (... 8e-05 

qi I 3891964 j pdb | 1AV3 | Potassium Channel Blocker Kappa Conot . . . 45 8e-05 

qi|22002036|splP56633|CXK7 CONPU Kappa-conotoxin PVI lA prec . ___45 8e-05 

qi I 21402496 I ref I NP 658481.11 minC, Bacterial proteins invol . . . 28 15 

qi I 12 656031 |qb| AAK00711. 1 IAF228 515 1 polyprotein [Clitoria . . . __24 275 

qi I 17533597 i ref i NP 495391.11 Putative membrane protein, wit... 23 369 

qi i 310 662 I qb 1 AAA3004 8 . 1 | ETS homologue _23 4 95 

qi I 15603466 i ref ! NP 246540.11 TagI [Pasteurella multocida] >... _23 495 

qi I 99609 Ipir | | S14947 2S albumin - Brazil nut >gi i 17713 | emb | . 23 495 

qi I 18571020 I ref IXP 095628.11 hypothetical protein XP_095628... __23 495 

qi i 23047878 I qb I ZP 00075808.11 hypothetical protein [Methane. . __22 664 

qi I 90454 |pir | | SQ1845 DNA {cytosine-5- ) -methyltransf erase (E... 22 664 

qi I 13878219 I ref I NP 113566.11 ribonuclease/angiogenin inhibi . . . 22 664 
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2 DO y / 1 0 1 go 1 AAC4 (JU ol . 1 1 UNA ( cy tosme-o J -itietnyxt rans teras . . . 


0 0 

z z 


b b 4 


gi 


1 oo3oZ 1 \ gJo 1 AAFoU y bo . 1 | DNA me Lnyl trans lerase [i^^us muscui . . . 


O 0 

z z 


b b4 


qi 


IzOUz / Ul 1 gb 1 A/\b4 oJ/b.iiAbXocio/41 i purat: 1 ve cy tiosine una . . . 


0 0 

zz 


b b4 


gi 


17 6byiy 1 emD 1 CAAJzyiu . 1 1 UNA mernyltirans terase x L^^us mus . . . 


o o 
z z 


bb4 


91 


D/boDD4irer|Nr UJ4xyb.l| UNA metinyxrrans X erase \ cy ros me . . . 


o o 
z z 


bb4 


91 


z44 / b4XU 1 clD3 1 bAOzz! bbU.Xi pxexmoX LiYius muscuxus j 


0 0 

z z 


o y u 


gi 


looybzoojrexINP z4y/JU.X| conserveu nypotne uxcax proiiein . . . 


0 1 
Z X 


X X y b 


qi 


/4J4ozo|pxrj iiXooXU UNA— Qirec L ea una poxyiuerase t niL. z . / . . . . 


9 1 
Z X 


1 1 Q c; 
X X y 3 


qi 


12 72304 1 gb 1 AAB(Jil44 . X 1 aXpna esterase 


0 1 
Z X 


1 1 Q 

X X y o 


qi 


6Jz52Jl|rer!NP UXozyy.Xl mux ti copy suppressor or snxo an... 


O 1 
Z X 


1 X y o 


qi 


2 4 0 o 0 4 o 1 1 re X 1 N r bxuuox.zj obyoXo-rA [urosopnxxa mexanoga... 


O 1 
Z X 


1 1 Q 

X X y D 


gi 


24 58 317 9 1 ref 1 NP 60932/. 2 1 CG31/12-PA [DrosopnxXa mexanog... 


0 1 

Z X 


X X y b 


qi 


234bob31 j gb jZP U0l23yDb.i| nypotnetxcax protexn [ rseuaom. . . 


0 1 
Z X 


1 1 Q R 

X X y o 


qi 


1 9527 983 j gb 1 AALy 0 10 D . 1 1 Aiiyoybp [UrosopnxXa mexanogaster J 


0 1 

Z X 


X X y b 


qi 


2 5 032 / 4 b 1 rex 1 XP 2Ub420.Xl nypotnetxcax protein ap zud4ZU... 


O 1 
Z X 


X X y b 


qi 


/bbXy/OjrexjNP Obo4ob.X| ixXAAUX / o gene prouuct [nomo sap . . . 


O 1 
Z X 


1 1 Q R 

X X y D 


gi 


200/OXbbjrerlNP uuzzz/.z| potassium voxtage— gatea cnanne . . . 


0 1 
Z X 


1 1 Q 

X X y o 


gi 


Xby4b3 /X i emb i LAB y / z / 3 . z 1 cons ervea nypotnetxcax protexn . . . 


9 1 
Z X 


X X y o 


qi 


/oXoZoZlpiri [JLoyiy potas s lum cnanne x x iiuuia.n ^g i i z / o . . . 


9 1 
z ± 


X X y o 


qi 


2 4 94 1 o 0 1 sp 1 y2 / bO / I UPbX ukuiyiXj UNA poxymerase gamma suDunx . . . 


0 n 

Z X 


I 1 Q R 

I I y o 


qi 


X2o4yo / b 1 uD J j dAdZ o o X o . X | riDs X — xiKe ^o, cerevxsiae / ~aat a . . . 


9 1 
Z 1 


1 1 Q R 

X X y o 


gi 


X2biyb43 1 gb 1 AAOb04 33 .XjAhZXbUUo 1 conotoxin sea XX OX a vx/ . . . 


0 1 
Z X 


X X y D 


qi 


1 9 0 / 4 4 3 b 1 rex 1 NP boby4Z.X| sxmxxarity to bUNX-Lir\L ikanoXi, . . 


9 1 

Z X 


X X y b 


qi 


234 7 5177 1 gb 1 ZP 001304bb.l| nypotnetxcax protein [UesuXro... 


O 1 

Z X 


X X y b 


qi 


19527 699 1 gb 1 AAL89964 . 1 1 AT02241p [DrosopnxXa mexanogaster J 


O 1 

Z X 


X X y b 


qi 


1335o405jrer|NP 0/o/4o.l| nypotnetxcax protein [ uympnocy . . . 


O 1 
Z X 


X X y o 


gi 


1 6 94 4 5 65 t emb ! CAC18 18 4 . 2 1 conserved hypothetical protein ... 


O T 

Z X 


X X y b 


gi 


21288 64 7 1 gb 1 EAA0094 0 . 1 1 agCP12/0o L-^i^opheXes gambxae str... 


O 1 

Z X 


X X y b 


qi 


113b9b0b|pxr| | IbOybb nypotnetxcax protein bz 4 P / . x xu [ xmp . . . 


0 1 

Z X 


X X y D 


gi 


25146207 1 rex |NP /4Xb23. X t GXycosyX transferase ( Jo . o KU; . . . 


O 1 

Z X 


X X y b 


qi 


24b44ob4|rer|NP b24Zb/-Z! L-UXZo/ — PA [ urosopnixa mexanoga . . . 


9 1 
Z X 


1 1 Q 

X X y z) 


gi 


7 505021 1 pxr | | T34 006 nypotnetxcax protein H43X0/.Z - Caen . . . 


9 1 
Z X 


X X y b 


gi 


64 / 8 / 64 1 gb 1 AAFljyyi . 1 1 Ah ZOObol 3 coat protein [boyoean m. . . 


9 1 
Z X 


X X y b 


qi 


22055778 1 ref 1 XP 090213.51 hypothetical protein XP 090213... 


2 1 


tine 

1 1 yb 


gi 


21293200 1 gb 1 EAA0534 5 . 1 j agCP9199 [Anopheles gambiae str.... 


O T 

2 1 


1 1 y b 


qi 


21553101 1 ref 1 NP 660128.11 afamin precursor; alpha albumi... 


O 1 

Z 1 


Tine 

1 1 yb 


gi 


15233602 1 ref ! NP 194676.11 putative protein; protein id: ... 


O 1 

2 1 


line 

1 1 yb 


gi 


17136648 1 ref i NP 476821.11 CG8987-PA [Drosophila melanoga... 


2 1 


line 

1195 


qi 


11890594 1 gb 1 AAG41136 . 1 1 polyprotein [Soybean mosaic viru... 


2 1 


1195 


qi 


6478701 1 gb 1 AAF13964 . 1 1 AF200bb4 2 coat protein [boybean m. . . 


O T 

2 1 


1 1 y b 


qi 


22 964283 1 gb { ZP OOOllooy.lj nypotneticaX protein [Knoaops... 


O 1 

Z 1 


1 1 y b 


gi 


200 / 23ob j gb | /\AH2 6bo X . X | Unknown (protexn tor XMAbh : bO b4 4 . . . 


O 1 

Z X 


1 X y b 


qi 


l/lbo3yy|ret|NP 4//oiy.i| wsvzy/ [ snrimp wnite spot syna . . . 


O 1 

Z X 


X xy b 


qi 


2 0 0 4 b y / b 1 re t 1 XP 13/yoX.X| sxmiXar to potassium voitage-g... 


O 1 

Z X 


X X y b 


gi 


15 61 6394 1 ref 1 NP 244699.11 BH3o32'^unKnown [Bacillus naXod. . . 


O T 

2 X 


1 1 yb 


qi 


2009034 1 j ref 1 NP 616416. 1| potassium uptake protexn [Metn. . . 


O 1 

Z X 


1 1 yb 


gi 


2 12 958 4 3 1 gb 1 EAA07 98 8 . 1 1 agCPlb/b [Anopheles gambxae str.... 


2 X 


1 1 yb 


qi 


1435084 1 emb 1 CAA6754 9 . 1 1 zinc finger protein [Ascobolus i... 


O T 

Z X 


Tine 

1 1 yb 


qi 


14 38 5 4 1 I gb i AAB04 09 9 . 1 i sarcoenaoplasmic reticulum Caz+ A... 


O 1 

2 1 


1603 


qi 


18 15 634 1 gb 1 AAC4 4 800 . 1 1 glutamine synthetase type 1 


O 1 

2 1 


T ^ n o 

1603 


qi 


21672065 1 gb 1 AAM74427 . 1 |AC123b94 10 Putative lipid transt... 


O 1 

Z 1 


1 iT n O 

1603 


gi 


19920032 i gb 1 AAM08471 . 1 1 hypothetical protein [ Dictyostel . . . 


2 1 


1603 


gi 


lb224439jret|NP 181342. l[ nypotneticaX protein; protein . . . 


O 1 

Z 1 


1 n 9 
1 bOo 


qi 


67 60665 i gb I AAD4 5537 . 2 j AFlb4 b /o 1 FLASH homolog KlP2b [Ho... 


O 1 

Z 1 


1 ri o 
1603 


qi 


X-3J/DXuy|rex|Nr u/-?U'4Z.Xj nypot ne t X ca X protexn rxiuz-jfiox . . . 


9 1 
Z X 


X DU -J 


qi 


17545702 IreflNP 519104.11 HYPOTHETICAL PROTEIN [Ralstoni... 


~21 


1603 


gi 


18584880 1 ref |XP 102118.11 hypothetical protein XP 102118... 


21 


1603 


qi 


19059811gblAAB50229.1| Gal/GalNAc lectin heavy subunit [... 


21 


1603 


qi 


6681641 1 dbj 1 BAA88816. 1 1 Yb-B19L [Yaba monkey tumor virus] 


21 


1603 


qi 


21233883 1 ref 1 NP 640181.11 hypothetical protein [Proteus ... 


21 


1603 


qi 


18676383 ! emb 1 CAD21521 . 1 1 Brachyury [Podocoryne carnea] 


21 


1603 
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ZUDZ looZ \ CiD^ \ oAAyZoDo , z \ KiAAUio protem [Homo sapiensj 


0 1 
Z 1 


1 n ^ 
1 bU o 


gi 


11288744 Ipirl \i^3odZ nypotnetical protein Bz4Fll . oU [imp. . . 


z 1 


-1 /r n Q 

1 bU J 


gi 


1 61 97 Doo 1 emJo 1 CACo^ y D / . 1 1 putative transposition protein ... 


0 1 

Z 1 


1 bu J 


gi 


13508551 1 emb j CACJolbz . 1 1 TniB protein [AcnromoDacter xyl . . . 


Z 1 


1 n '3 
X bU o 


gi 


20810180 1 go 1 AAH29359 . 1 1 similar to nypotnetical protein ... 


z 1 


1 n T 
X bUo 


gi 


18397365 1 rer 1 NP 566259.11 Lon protease, putative; protei... 


Z 1 


1 bU J 


gi 


2360134 8 1 ref 1 XP 178510.1] nypotnetical protein XP 1/oblO... 


Z 1 


1603 


gi 


2009107 0 1 ref 1 NP 617145. Ij predicted protein [Methanosarc . . . 


Z 1 


X bU 3 


gi 


15920020 1 ref 1 NP j>61080.1i TniB protein [Piasmia pbBlUzJ ... 


Z 1 


X bU3 


gi 


2350 94 2 1 1 ref j NP 702088. li hypothetical protein [Plasmodi... 


Z 1 


T n o 
X bU3 


gi 


158 058 58 1 ref I NP 294556.1! conserved hypothetical protein... 


O 1 

Z 1 


X bUo 


gi 


7 94 9131 1 ref ! NP 058025.11 ATPase, Ca++ transporting, ubiq. . . 


Z 1 


X 603 


gi 


21299840 1 gb 1 EAA11985 . 1 1 agCP8898 [Anopheles gambiae str.... 


Z 1 


X bU3 


qi 


1754 8 954 1 ref 1 NP 522294.11 HYPOI HL I lOAL FKOIii.lN [Kalstoni... 


O 1 

Z 1 


T /r n o 
X bUo 


gi 


69122 8 8 1 ref 1 NP 036247.11 CASP8 associated protein 2; FLI . . . 


21 


X 603 


gi 


212 9X42 / [ go 1 hAAU Jo / z . 1 | agCFi4 z / 4 [ Anopneies gamoiae srr . , . 


Z 1 


X bU J 


gi 


2034 622 9 1 ref 1 XP 109250.11 ATPase, Ca++ transporting, ubi . . . 


Z 1 


T /" A O 

X bU 3 


gi 


75623/61rer|NP (Jobbyo.i! KlAAUyi4 gene proauct [homo sap , . , 


Z 1 


X bU o 


gi 


lzlZZD|Sp|JrlJD/OloijE5j ilLiniLi LilOlJin llD, cXL-LciCfc:J.XUJ.clX ^Cj... 


9 1 

Z. X 


X D U O 


gi 


6753288 i ref 1 NP 036127.11 caspase 8 associated protein 2;... 


~21 


1603 


gi 


23509386 1 ref 1 NP 702053.11 hypothetical protein [Plasmodi... 


21 


1603 


gi 


7439466 1 pir 1 t S71915 hemoglobin, extracellular, chain B 


21 


1603 


gi 


12057020 1 emb i CAC19873 . 1 i putative notch receptor protein... 


21 


1603 


gi 


1389559|qb|AAB02882. 1 1 glycoprotein 330 


21 


1603 


gi 


21672069 1 qb IAAM74431 . 1 IAC123594 14 Putative lipid tranfe... 


21 


1603 



Alignments 

Get selected sequences 
Select all 
Deselect all 



> gi 138 918 72 1 pdb | IKCP | 3d Structure Of K-Conotoxin Pviia, A Novel Potassium 
Channel-Blocking Toxin From Cone Snails, Nmr, 22 
Structures 
Length = 28 

Score - 45.2 bits (99), Expect - 8e-05 
Identities = 19/27 (70%), Positives = 19/27 (70%) 

Query: 1 CXIXNQXCXQHLDDCCSXXCNXXNXCV 2 7 

C I NQ C QHLDDCCS CN N CV 
Sbjct: 1 CRIXNQKCFQHLDDCCSRKCNRFNKCV 27 



> gi i 7519866 i pir 1 IA58997 kappa-conotoxin PVIIA - cone shell (Conus purpurascens ) 
Length = 27 

Score = 45.2 bits (99), Expect = 8e-05 
Identities = 18/27 (66%), Positives = 18/27 (66%) 



Query: 1 CXIXNQXCXQHLDDCCSXXCNXXNXCV 27 

C I NQ C QHLDDCCS CN N CV 
Sbjct: 1 CRIPNQKCFQHLDDCCSRKCNRFNKCV 27 
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> gi 138 91964 1 pdb | 1AV3 | Potassium Channel Blocker Kappa Conotoxin Pviia From C. 
Purpurascens, Nmr, 20 Structures 
Length = 27 

Score = 45.2 bits (99), Expect - 8e-05 
Identities - 19/27 (70%), Positives = 19/27 (70%) 

Query: 1 CXIXNQXCXQHLDDCCSXXCNXXNXCV 27 

C I NQ C QHLDDCCS CN N CV 
Sbjct: 1 CRIXNQKCFQHLDDCCSRKCNRFNKCV 27 



> qi |22Q02036|sp| P56633 | CXK7 CONPU Kappa-conotoxin PVIIA precursor (Fin-popping peptide) 
Length = 72 

Score = 45.2 bits (99), Expect = 8e-05 
Identities = 18/27 (66%), Positives = 18/27 (66%) 

Query: 1 CXIXNQXCXQHLDDCCSXXCNXXNXCV 27 

C I NQ C QHLDDCCS CN N CV 
Sbjct: 4 6 CRIPNQKCFQHLDDCCSRKCNRFNKCV 72 



> gi 1 214 024 96 I ref I NP 658481.11 minC, Bacterial proteins involved in chromosomal 
partioning 

[Bacillus anthracis A2012] 
Length = 228 

Score =27.8 bits (58), Expect = 15 
Identities = 7/7 (100%), Positives = 7/7 (100%) 

Query: 11 HLDDCCS 17 

HLDDCCS 
Sbjct: 21 HLDDCCS 27 



> gi I 12 65 6031 [ qb | AAK0 0711 . 1 | AF22 8 515 1 polyprotein [Clitoria virus Y] 
Length = 489 

Score - 23.5 bits (48), Expect = 275 
Identities = 6/9 (66%), Positives = 7/9 (77%) 

Query: 8 CXQHLDDCC 16 

C QH+D CC 
Sbjct: 197 CAQHIDGCC 205 



> gi I 17533597 i ref I NP 495391.11 Putative membrane protein, with at least 2 transmembrane 
domains 

[Caenorhabditis elegans] 
gi I 7500246 ipir | | T16223 hypothetical protein F31E8.4 - Caenorhabditis elegans 
gi I 12 8 0155 | gbj AAA98Q2Q . 1 | Hypothetical protein F31E8.4 [Caenorhabditis elegans] 
Length = 162 

Score = 23.1 bits (47), Expect' = 369 
Identities = 6/7 (85%), Positives = 6/7 (85%) 
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Query: 10 QHLDDCC 16 

QHL DCC 
Sbjct: 156 QHLTDCC 162 



> qi I 310 662 I gb j AAA3004 8 . 1 [ ETS homologue 
Length - 559 

Score - 22.7 bits (46), Expect = 495 
Identities = 5/6 (83%), Positives = 6/6 (100%) 

Query: 11 HLDDCC 16 

HLD+CC 
Sbjct: 22 HLDECC 27 



> qi i 15603466 I ref i NP 246540.11 TagI [Pasteurella multocida] 
qi I 12721997 I qb | AAK0 3 685 . 1 j TagI [Pasteurella multocida] 
Length = 183 

Score = 22.7 bits (46), Expect = 495 
Identities = 5/6 (83%), Positives = 6/6 (100%) 

Query: 11 HLDDCC 16 

HL+DCC 
Sbjct: 17 6 HLNDCC 181 



> qi I 99609 Ipir | | S14947 28 albumin - Brazil nut 
qi I 17 713 I emb | CAA38 363 . 1 j 2S albumin [Bertholletia excelsa] 
Length - 154 

Score = 22.7 bits (46), Expect = 495 
Identities = 5/6 (83%), Positives = 6/6 (100%) 

Query: 11 HLDDCC 16 

HLD+CC 
Sbjct: 86 HLDECC 91 



> gi i 18571020 I ref I XP 095628.11 hypothetical protein XP_095628 [Homo sapiens] 
Length = 276 

Score = 22.7 bits (46), Expect = 495 
Identities = 6/7 (85%), Positives = 6/7 (85%) 

Query: 11 HLDDCCS 17 

HL DCCS 
Sbjct: 99 HLKDCCS 105 



> gi I 23047878 I gb I ZP 00075808.11 hypothetical protein [Methanosarcina barkeri] 
Length = 175 

Score - 22.3 bits (45), Expect = 664 
Identities = 5/7 (71%), Positives - 7/7 (100%) 
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Query: 10 QHLDDCC 16 

Q+LD+CC 
Sbjct: 155 QNLDECC 161 



> gi I 90454 Ipir | | S01845 DNA ( cytosine-5- ) -methyltransf erase (EC 2.1.1.37) - mouse 
Length - 1573 

Score = 22.3 bits (45), Expect = 664 
Identities = 6/7 (85%), Positives = 6/7 (85%) 

Query: 8 CXQHLDD 14 

C QHLDD 
Sbjct: 242 CGQHLDD 248 



> gi I 13878219 I ref I NP 113566.11 ribonuclease/angiogenin inhibitor 2 [Mus musculus] 

gi I 1360384 5 I gb | AAK319 60 . 1 | AF28 5581 1 ribonuclease/angiogenin inhibitor 2 [Mus 
musculus] 

Length = 748 
Score = 22.3 bits (45), Expect - 664 

Identities = 8/11 (72%), Positives - 8/11 (72%), Gaps = 1/11 (9%) 

Query: 3 IXNQXCXQHLD 13 

I NQ C QHLD 
Sbjct: 453 IQNQ-CLQHLD 4 62 



> gi I 268 9716 { gb | AAC4 0061 . 1 | DNA ( cytosine-5 ) -methyitrans f erase [Mus musculus] 
Length = 1502 

Score =22.3 bits (45), Expect - 664 
Identities = 6/1 (85%), Positives = 6/7 (85%) 

Query: 8 CXQHLDD 14 

C QHLDD 
Sbjct: 243 CGQHLDD 249 



> gi I 7 339827 | gb | AAF60 965 . 1 | DNA methyltransf erase [Mus musculus] 
gi i 971924 9 I gb | AAF97 695 . 1 | DNA ( cytosine-5 ) -methyltransf erase [Mus musculus ] 
Length = 1502 

Score =22.3 bits (45), Expect = 664 
Identities = 6/7 (85%), Positives = 6/7 (85%) 

Query: 8 CXQHLDD 14 

C QHLDD 
Sbjct: 244 CGQHLDD 250 



> gi I 12002701 |gb|AAG4 337 6. 1 iAF15587 4 1 putative cytosine DNA methyltransf erase [Oryza 
sativa] 

Length = 1501 
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Score - 22.3 bits (45), Expect = 664 
Identities - 6/7 (85%), Positives = 6/7 (85%) 

Query: 8 CXQHLDD 14 

C QHLDD 
Sbjct: 243 CGQHLDD 249 



> qi i 17 65 919 I emb j CAA32 910 . 1 | DNA methyitransf erase 1 [Mus musculus] 
Length - 1620 

Score - 22.3 bits (45), Expect - 664 
Identities = 6/7 (85%), Positives = 6/7 (85%) 

Query: 8 CXQHLDD 14 

C QHLDD 
Sbjct: 361 CGQHLDD 367 



> gi I 6753664 I ref I NP 034196.11 DNA methyitransf erase (cytosine-5) [Mus musculus] 

gii20141336|sp|P13864 IDNMl MOUSE DNA ( cytosine-5 ) -methyltrans f erase 1 (Dnmtl) (DNA 
methyitransf erase 

Mmul) (DNA MTase Mmul) (MCMT) (M.MmuI) (Met-1) 
gi I 6625 68 7 | gb | AAF19352 . 1 | DNA methyltrans f erase [Mus musculus] 
Length = 1620 

Score =22.3 bits (45), Expect = 664 
Identities - 6/7 (85%), Positives = 6/7 (85%) 

Query: 8 CXQHLDD 14 

C QHLDD 
Sbjct: 362 CGQHLDD 368 



> gi I 24 4 7 54 10 I dbj IBAC2 2 6 60. 1| plexinBl [Mus musculus] 
Length - 2119 

Score = 21.8 bits (44), Expect = 890 

Identities - 7/10 (70%), Positives = 7/10 (70%), Gaps - 1/10 (10%) 

Query: 8 CXQHLD-DCC 16 

C QHLD D C 
Sbjct: 482 CAQHLDCDSC 4 91 



> gi{ 15596236|ref |NP 249730.11 conserved hypothetical protein [Pseudomonas aeruginosa 
PAOl] 

gi I 11347735 [pir I IG83515 conserved hypothetical protein PA1039 [imported] - Pseudomonas 

aeruginosa (strain PAOl) 
gij 994 6952 [gb|AAG04 4 28 . 1 I AE004 536 7 conserved hypothetical protein [Pseudomonas 
aeruginosa PAOl] 

Length - 157 

Score = 21.4 bits (43), Expect = 1195 
Identities - 5/5 (100%), Positives = 5/5 (100%) 
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Query: 12 LDDCC 16 

LDDCC 
Sbjct: 14 LDDCC 18 



> gi I 7434825 Ipir i IT13810 DNA-direct ed DNA polymerase (EC 2.7.7.7) gamma, mitochondrial - 
fruit fly (Drosophila melanogaster) 
gii2289912|gb | AAC4 7 658 . 1 | mitochondrial DNA polymerase catalytic subunit precursor 
[Drosophila melanogaster] 
Length = 1145 

Score = 21.4 bits (43), Expect = 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 662 LDDCC 666 



> gi I 12 7 2304 I gb | AAB01144 . 1 | alpha esterase 
Length = 548 

Score - 21.4 bits (43), Expect = 1195 
Identities = 5/6 (83%), Positives = 6/6 (100%) 

Query: 10 QHLDDC 15 

+HLDDC 
Sbjct: 273 EHLDDC 278 



> gi I 6325231 { ref i NP 015299.11 multicopy suppressor of snf3 and grrl mutants; Skslp 
[ Saccharomyces 

cerevisiae] 

gi 1 2499625 I sp [Q125Q5 I SKSl YEAST Serine/threonine-protein kinase SKSl (Suppressor 
kinase of SNF3) 

gi I 2133138 I pir I IS61935 SKSl protein - yeast (Saccharomyces cerevisiae) 
gi| 10394 5Hgb IAAB68161.il Sha3p 
gil 1173541 Igb IAAC4 957 0.il Skslp 
Length - 502 

Score = 21.4 bits (43), Expect = 1195 
identities = 5/7 (71%), Positives = 1/1 (100%) 

Query: 11 HLDDCCS 17 

H+D+CCS 
Sbjct: 243 HVDECCS 249 



> gi I 24585481] ref |NP 610051.21 CG9316-PA [Drosophila melanogaster] 
gi I 22 94 6930 | qb I AAF53915 . 2 | CG9316-PA [Drosophila melanogaster] 
Length = 448 

Score =21.4 bits (43), Expect = 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 



Query: 12 LDDCC 16 
LDDCC 



9 



Sbjct: 236 LDDCC 240 



> gi I 24583179 I ref I NP 609327.21 CG31712-PA [Drosophila melanogaster ] 
gi 1 22 94 6070 | gb | AAF52833 . 2 | CG31712-PA [Drosophila melanogaster] 
Length = 309 

Score = 21.4 bits (43), Expect - 1195 
Identities - 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 216 LDDCC 220 



> gi 1234686311 gblZP 00123966.11 hypothetical protein [ Pseudomonas syringae pv. syringae 
B728a] 

Length = 610 

Score - 21.4 bits (43), Expect - 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 304 LDDCC 308 



> qi I 19527 98 3 i gb | AAL9010 6 . 1 | AT19096p [Drosophila melanogaster] 
Length = 4 47 

Score - 21.4 bits (43), Expect = 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 234 LDDCC 238 



> qi I 25032745 1 ref ! XP 206420.1) hypothetical protein XP_206420 [Mus musculus] 
Length - 118 

Score = 21.4 bits (43), Expect = 1195 
Identities - 5/6 (83%), Positives - 5/6 (83%) 

Query: 10 QHLDDC 15 

HLDDC 
Sbjct: 10 HHLDDC 15 



> gi i 7661970 I ref I NP 055455.11 KIAA0173 gene product [Homo sapiens] 
gi I 20455377 { sp IQ14679 I Y173 HUMAN Hypothetical protein KIAA0173 

gi| 1136406! dbj I BAA114 90 . 1 I similar to pig tubulin-tyrosine ligase. [Homo sapiens] 
Length - 1199 

Score - 21.4 bits (43), Expect - 1195 
Identities - 5/6 (83%), Positives = 6/6 (100%) 
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Query: 12 LDDCCS 17 

L+DCCS 
Sbjct: 519 LEDCCS 524 



> gi 1200701661 re f I NP 002227.21 potassium voltage-gated channel, subfamily F, member 1; 
potassium 

channel KHl [Homo sapiens] 
qi I 24418476 I sp [ Q9H3M0 I KCFl HUMAN Potassium voltage-gated channel subfamily F member 1 
(Volt age -gated 

potassium channel protein Kv5.1) (kHl) 
qi 1120018241 gb | AAG4 3Q55 . 1 | potassium channel [Homo sapiens] 

gi 1199134911 gb I AAH2 6110 . 1 | potassium voltage-gated channel, subfamily F, member 1 
[Homo 

sapiens] 
Length =4 94 

Score - 21.4 bits (43), Expect = 1195 
Identities - 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 123 LDDCC 127 



> gi I 16945371 | emb | CAB 972 7 5 . 2 | conserved hypothetical protein [Neurospora crassa] 
Length = 1405 

Score - 21.4 bits (43), Expect = 1195 
Identities = 5/6 (83%), Positives = 6/6 (100%) 

Query: 10 QHLDDC 15 

+HLDDC 
Sbjct: 1182 EHLDDC 1187 



> gi i 7513252 I pir I |JC5919 potassium channel . 1 - human 
gi I 27 39501 I qb| AAC055 97 . 1 | potassium channel [Homo sapiens] 
Length - 4 94 

Score = 21.4 bits (43), Expect = 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 123 LDDCC 127 



> gi j 2494180 I sp I Q27607 I DPGl DROME DNA polymerase gamma subunit 1, mitochondrial 
precursor 

(Mitochondrial DNA polymerase catalytic subunit) 
gi I 7447799 Ipir | j T13808 DNA-directed DNA polymerase (EC 2.7.7.7) gamma chain - fruit 
fly 

(Drosophila melanogaster ) 
gi I 1401344 | gb | AAC4 72 90 . 1 | DNA polymerase gamma 
Length == 1145 



11 



Score =21.4 bits (43), Expect = 1195 
Identities - 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 662 LDDCC 666 



> gi I 128 4 9875 I dbj | BAB28515. 1 | Hbsl-like (S. cerevisiae ) -data sourcerMGD, source 
key:MGI : 1891704, 

evidence : ISS-'putative [Mus musculus] 
Length = 211 

Score =21.4 bits (43), Expect = 1195 

Identities = 7/13 (53%), Positives = 7/13 (53%), Gaps - 6/13 (46%) 

Query: 11 HLD DCCS 17 

HLD DCCS 
Sbjct: 163 HLDSSSKPFDCCS 175 



> gi I 12 61954 3 I qb | AAG604 33 . 1 | AF215005 1 conotoxin scaffold VI/VII precursor [Conus 
ventricosus ] 

Length = 74 

Score - 21.4 bits (43), Expect - 1195 

Identities = 7/10 (70%), Positives = 7/10 (70%), Gaps = 1/10 (10%) 

Query: 8 CXQHLDDCCS 17 

C QH DCCS 
Sbjct: 55 CTQH-SDCCS 63 



> gi I 19074436 I ref [NP 585942. 1| similarity to GCNl-LIKE TRANSLATIONAL ACTIVATOR 
[Encephalitozoon 

cuniculi ] 

gi I 190 69078 | emb | CAD2 554 6 . 1 | similarity to GCNl-LIKE TRANSLATIONAL ACTIVATOR 
[Encephalitozoon 

cuniculi ] 
Length = 1323 

Score = 21.4 bits (43), Expect - 1195 
Identities = 5/6 (83%), Positives = 6/6 (100%) 

Query: 12 LDDCCS 17 

LD+CCS 
Sbjct: 779 LDECCS 784 



> gij23475177|gb|ZP 00130466.11 hypothetical protein [ Desuif ovibrio desulf uricans G20] 
Length = 589 

Score = 21.4 bits (43), Expect = 1195 
Identities = 5/7 (71%), Positives = 6/7 (85%) 



Query: 10 QHLDDCC 16 
Q LD+CC 
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Sbjct: 203 QRLDECC 209 



> gi I 19527 699 | gb| AAL8 9964 . 1 | AT02241p [Drosophila melanogaster ] 
Length - 1145 

Score = 21.4 bits (43), Expect = 1195 
Identities = 5/5 (100%), Positives - 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 662 LDDCC 666 



> gi I 133584 05 I ref I NP 078748.11 hypothetical protein [Lymphocystis disease virus 1] 
Length - 1085 

Score - 21.4 bits (43), Expect - 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 825 LDDCC 829 



> gi j 16944565 } emb | CAC18184 . 2 | conserved hypothetical protein [Neurospora crassa] 
Length = 590 

Score - 21.4 bits (43), Expect = 1195 
Identities - 5/6 (83%), Positives = 6/6 (100%) 

Query: 10 QHLDDC 15 

+HLDDC 
Sbjct: 374 EHLDDC 379 



> gi I 2128 8 647 | gb | EAA0094 0 . 1 j agCP12708 [Anopheles gambiae str. PEST] 
Length ^ 552 

Score = 21.4 bits (43), Expect - 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 207 LDDCC 211 



> gi [ 11359505 1 pir I IT50956 hypothetical protein B24P7.110 [imported] - Neurospora crassa 
Length =97 9 

Score = 21.4 bits (43), Expect = 1195 
Identities = 5/6 (83%), Positives - 6/6 (100%) 



Query: 10 QHLDDC 15 

+HLDDC 
Sbjct: 756 EHLDDC 761 
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> qi I 25146207 I ref 1 NP 741523.11 Glycosyl transferase (38.8 kD) [Caenorhabditis elegans] 

qi I 17 4 02836 I qb | AAL38 962 . 1 | AF125 965 3 Hypothetical protein H43I07.3 [Caenorhabditis 
elegans ] 

Length - 339 

Score - 21.4 bits (43), Expect = 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 100 LDDCC 104 



> qi I 24644854 I ref I NP 524267.21 CG1257-PA [Drosophila melanogaster ] 

qi I 17 8 62822 I gb | AAL3988 8 . 1 | LP07235p [Drosophila melanogaster] 

qi I 2317 057 5 I qb | AAF54QQ4 . 2 | CG1257-PA [Drosophila melanogaster] 

qi I 2 5012 4 62 I qb | AAN7133 6. 1 | RE24420p [Drosophila melanogaster] 
Length - 543 

Score = 21.4 bits (43), Expect = 1195 
Identities - 5/6 (83%), Positives = 6/6 (100%) 

Query: 10 QHLDDC 15 

+HLDDC 
Sbjct: 273 EHLDDC 278 



> qi I 7505021 Ipir I IT34006 hypothetical protein H43I07.2 - Caenorhabditis elegans 
Length = 697 

Score = 21.4 bits (43), Expect = 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 100 LDDCC 104 



> gi I 64 7 87 64 I gb | AAF13991 . 1 1 AF200581 3 coat protein [Soybean mosaic virus] 
Length - 277 

Score = 21.4 bits (43), Expect - 1195 
Identities - 5/6 (83%), Positives = 6/6 (100%) 

Query: 11 HLDDCC 16 

H+DDCC 
Sbjct: 1 HVDDCC 6 



> gi I 22055778 1 ref ! XP 090213.51 hypothetical protein XP_090213 [Homo sapiens] 
Length = 705 

Score = 21.4 bits (43), Expect = 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 
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Query: 12 LDDCC 16 
LDDCC 

Sbjct: 249 LDDCC 253 



> qi I 212 932 00 I qb I EAA05 34 5. 1 j agCP9199 [Anopheles gambiae str. PEST] 
Length = 1129 

Score = 21.4 bits (43), Expect = 1195 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 331 LDDCC 335 



> qi I 21553101 1 ref I NP 660128.11 afamin precursor; alpha albumin [Mus musculus] 
qi I 5902744 I sp I 089020 I AFAM MOUSE Afamin precursor (Alpha-albumin) (Alpha-Alb) 
qi 136463611 emb j CAA0 9471 . 1 | alpha-albumin protein [Mus musculus] 
Length = 611 

Score = 21.4 bits (43), Expect = 1195 
Identities = 5/6 (83%), Positives - 6/6 (100%) 

Query: 12 LDDCCS 17 

L+DCCS 
Sbjct: 381 LEDCCS 386 



Get selected sequences 
Select all 
Deselect all 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

Posted date: Nov 25, 2002 1:38 AM 
Number of letters in database: 395,571,179 
Number of sequences in database: 1,242,768 

Lambda K H 

0.357 0.284 2.14 

Gapped 

Lambda K H 

0.294 0.110 0.610 



Matrix: PAM30 

Gap Penalties: Existence: 9, Extension: 1 

Number of Hits to DB : 8,008,999 

Number of Sequences: 1242768 

Number of extensions: 61346 

Number of successful extensions: 1959 

Number of sequences better than 20000.0: 1939 

Number of HSP's better than 20000.0 without gapping: 1939 
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Number of 
Number of 
Number of 
length of 
length of 
effective 
effective 
effective 
effective 
effective 
T: 11 
40 
14 ( 7 
35 (14 
58 (24 
34 (19 
34 (17 



HSP's successfully gapped in prelim test: 0 
HSP's that attempted gapping in prelim test 
HSP's gapped (non-prelim): 1959 
query: 27 

database: 395, 571, 179 

HSP length: 18 

length of query: 9 

length of database: 373,201,355 

search space: 3358812195 

search space used: 3358812195 



A: 

XI 
X2 
X3 
SI 
32 



.2 bits) 
.8 bits) 
. 6 bits) 
.3 bits) 
. 6 bits) 



Pat Gen PVIIA Search 



BLASTP 2.2.4 [Aug-26-2002] 



Reference : 

Altschul, Stephen F. , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs". Nucleic Acids Res. 25:338 9-34 02. 



RID: 1038288110-013776-17050 



Query= 

(27 letters) 



Database: Protein sequences derived from the Patent division of 
GenBank 

104,350 sequences; 16,453,681 total letters 



If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 



Taxonomy reports 



Distribution of 162 Blast Hits on the Query Sequence 









Score 


E 


Sequences producing significant alignments: 


(bits) 


Value 




320 9905 1 qb 1 AAC2084 9. 1 I 18 


1608 Sequence 4 from patent US 5. . . 


21 


47 


gi 


14118421 iqbjAAE59660. 1 I 


Sequence 90 from patent US 6187548 


21 


63 


gi 


14118419! qbl AAE5 9658 . 1 1 


Sequence 86 from patent US 6187548 


21 


63 


gi 


14118 4 20 |qb|AAE5 9659. 1 | 


Sequence 88 from patent US 6187548 


21 


63 


gi 


1512 4 758 ! qbl AAE7 32 19. 1 | 


Sequence 84 from patent US 6239270 


21 


63 


gi 


14118418! qb 1 AAE59657 . 1 | 


Sequence 84 from patent US 6187548 


21 


63 


gi 


59508 02 1 qbl AAE0502 8 . 1 | 


Sequence 2 from patent US 5869282... 


21 


84 


gi 


3209 90 4 1 qbl AAC20 84 8 . 1 | 18 


1607 Sequence 2 from patent US 5. . . 


21 


84 


gi 


214 387 96 1 emb | CAD34 7 8 5 . 1 | 


unnamed protein product [Homo s... 


20 


113 


gi 


3997978 |qb|AAC91410. 1 IAR017520 Sequence 2 from patent US... 


20 


113 


gi 


18 67 4 819|emb!CAD232 95. 1 | 


unnamed protein product [Homo s... 


20 


151 


gi 


12 82 4 4 50 1 qbl AAE4 9356. 1 | 


Sequence 1 from patent US 6110686 


20 


151 


gi 


15121659|gb! AAE72339. 1 I 


Sequence 9 from patent US 623570... 


20 


151 


gi 


2150 8 815 1 qbl AAM5 8270. 1 | 


Sequence 1 from patent US 6379925 


20 


151 


gi 


1831307|qb|AAB45761.1| 


Sequence 13 from patent US 5591821 


19 


203 



SI 
si 



si 
si 
ai 

ai 
al 
ai 
al 
ai. 
ai 
ai. 
ai 
ai 
ai 
ai. 
ai. 
ai 
ai. 
ai. 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 
ai 



1254 37 2 i Qb I AAA95 62 6. 1 j 



1831303|gb|AAB45757.1| 



512010 I emb | CAA01442.1i 



1613 52 5 1 qb I AAB16505. 1 | 



Sequence 6 from patent US 5482928... 
Sequence 4 from patent US 5591821 
ovine IL-1 beta [Ovis sp . ] 
Sequence 2 from patent US 5559095... 
21065816 I emb | CAD3204 6. 1 1 unnamed protein product [synthe... 

14110683 I qb I AAE57223.1 | Sequence 6 from patent US 617181... 
4 52 6603 I emb | C7\A0 04 57 . 1 | MAb MBrl VK sequence [synthetic . . . 
1410 37 31 i qb| AAE55165 . 1 { Sequence 8 from patent US 6153740 
141037 28 I qb I AAE55162 . 1 j Sequence 4 from patent US 6153740 
14110681 I qb I AAE57221 . 1 | Sequence 2 from patent US 617181... 

14110684 I qb| AAE572 24 . 1 1 Sequence 7 from patent US 6171816 
12 813715 I qb| AAE4 5015 . 1 | Sequence 20 from patent US 6083904 
141037 30 I qb I AAE5 5164 . 1 \ Sequence 6 from patent US 6153740 
1410372 9 jqb I AAE55 163.1 | Sequence 5 from patent US 6153740 
270588 |gb|AAA02097.1 | Sequence 2 from Patent US 4879374 
3994136 I qb I AAC87 5 64 . 1 1 AR007 930 Sequence 17 from patent U. . . 



14110685 I qb I AAE57225 . 1 j Sequence 8 from patent US 6171816 
18 3048 6 I qb j AAB4 4 94 0 . 1 1 Sequence 29 from patent US 558745... 
141197 03 I qb I AAE60017 . 1 1 Sequence 8 from patent US 619087... 
3011710 I qb I AAC11158 . 1 1 17 55 69 Sequence 3 from patent US 5... 



1613 54 2 1 qb I AAB16522 . 1 | Sequence 19 from patent US 555909. 
1831302 I qb I AAB4 575 6. 1 1 Sequence 3 from patent US 5591821 
27 25130 I qb I AAB92 812 . H 167150 Sequence 21 from patent US . 



4 7741931 emb j CAB42495 . 1 | unnamed protein product [unident.., 
4134 31 I embi CAA007 67 . 1 | 0RF2 [synthetic construct] 
1004 95 38 I qb| AAE2 64 45 . 1 | Sequence 13 from patent US 5948900 
12 807 911 I qb ! AAE4 3131 . 1 | Sequence 4 from patent US 6074840 
30117 09 I qb I AAC11157 . 1 |I7 55 68 Sequence 2 from patent US 5... 



21065678 1 emb | CAD32 037 . 1 | unnamed protein product [Homo s... 
2Q241673|qb|AAE92344 .1 | Sequence 338 from patent US 6348328 
3 9 94135 I qb I AAC87 5 63 . 1 | AR007 92 9 Sequence 16 from patent U... 



3011708 I qbl AAC1115 6. 1 j 17 55 67 



14 0 97 4 64 I qb IAAE523Q6.il 



18 674249 I emb | CAD23272 , 1 | unnamed protein product [Homo s.. 
3994138 I qb i AAC87 5 66. H AR007 932 Sequence 19 from patent U. . 

Sequence 1 from patent US 5.. 
Sequence 5 from patent US 6133232 
20241956 I qb I AAE92627 . 1 | Sequence 6 from patent US 6348348 
12 711156 I embI CAC28 4 93. 1 j At position 11, R is a purine. .. 

Sequence 8 from patent US 5559095.. 

ANTI FREEZE PROTEIN [Daucus carota] 
Sequence 21 from patent US 558745.. 
Sequence 2 from patent US 5597707.. 
Sequence 9 from patent US 5591821 
Sequence 10 from patent US 63070.. 
39 9575 9 I qb | AAC8 918 7 . 1 | AR01330 9 Sequence 3 from patent US.. 



1613531!qblAAB16511.1i 



67 4 07 90 I embI CAB694 52 . 1 i 



1830478 |qbiAAB44932.i: 



1831725|qblAAB46179.1. 



1831305 I qb! AAB4 575 9. 1 1 



17915749 jqb|AAE84475.H 



5 94 68 6H qb i AAE037 6 6. 1 I 



5946859|qb|AAE03764.1| 



16223597 I qbl AAE7 4 94 9.1 I 



24 8 9762 I qb| AAB7 7 8 98 . 1 | 159399 



17 9157 47 I gb| AAE8 44 73. 1 I Sequence 8 from patent US 630701.. 
5 94 68 64 | qb | AAE037 69 . 1 [ Sequence 20 from patent US 586654.. 
197 01037 I emb j CAD2 904 2 . 1 j unnamed protein product [Homo s.. 

Sequence 7 from patent US 5866542.. 
Sequence 4 from patent US 5866542.. 
Sequence 2 from patent US 6251655 
Sequence 4 from patent US 5.. 
10053997 I qbl AAE30 90 4 .1 | Sequence 3 from patent US 596552.. 
5 94162 I qb I AAA5 57 69. 1 j Sequence 17 from Patent EP 0324648 
1004 9907 I qb I AAE2 6814 . 1 | Sequence 12 from patent US 59522.. 
14106039 I qb I AAE55 654 . 1 | Sequence 23 from patent US 6160088 

Sequence 2 from patent US . . 
unnamed protein product [Homo s.. 
Sequence 2 from patent US 6416734 
Sequence 2 from patent US 6251636 
Sequence 2 from patent US 596552.. 
Sequence 8 from patent US 5 . . 



4 001198 I qbl AAC94630. 1 i AR025722 



2133 63 60 I emb | CAD334 48 . 1 | 



23316741 I qb I AAN212 65 . 1 I 



162234 78 I qb | AAE74 87 4 . 1 | 



10053996 I qb I AAE30 903. 1 | 



24 8410 9|qb|AAB7224 5. 1 II4 9138 
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3 



21538090 I emb | CAD35 982 . 1 : 



12829122|qb|AAE50822,l 



9 997 8 07 I emb 1CAC0754 8 . 1 



2 332 9052 I qb I AAN2 5 969 . 1 



2 14 38806 I emb I CAD34 7 90.1 



202 41852 [ qbl AAE92523. 1 I 



prepro-PAPP-A2 coding sequence . . 
Sequence 2 from patent US 6127159 
unnamed protein product [Mus mus . . 
Sequence 22 from patent US 6428967 
unnamed protein product [Homo s.. 
Sequence 517 from patent US 6348328 
3211666 I qbl AAC2 1426. 1 | 183369 Sequence 16 from patent US 
1018833H emb I CAC0 9134 . iT unnamed protein product [Zea mays] 
399914 3 i qbl AAC9257 5. 1 | AR021637 Sequence 3 from patent US.. 
10054000 I qb I AAE30 9Q7 . 1 1 Sequence 6 from patent US 596552.. 
2 332 9055 I qb | AAN2 5 97 2 . 1 | Sequence 25 from patent US 6428967 
2332 9053 | qb I AAN2 5 97 0 . 1 | Sequence 23 from patent 
320874 9 Iqbl AAC20330. 1 | 1804 59 Sequence 



10Q57863|qb!AAE34769.1| 



157 974 7 4 | emb ! CAC88318 . 1 | 



2332 9054 | qb I AAN25 97 1 . 1 | 



6733930 I emb | CAB69362 . 1 



2 332 9051 I qb I AAN2 5 968 . 1 I 

3 999808 Iqbl AAC9324 0. II AR022 94 2 



100 4 990 6 I qb I AAE2 6813. 1 | 



12811284 I qb I AAE4 4 2 97 . 1 | 



5 9538 50 Iqbl AAEO 6354 . 1 



160 692 3 I Qbl AABO 98 80. 1 I 



3717241 |emb|CAAQ3733. 1 | 



US 6428967 
4 from patent US 5. . . 
Sequence 6 from patent US 5977442 

unnamed protein product [Homo s.., 
Sequence 24 from patent US 6428967 
unnamed protein product [unident.., 
Sequence 21 from patent US 6428967 
Sequence 9 from patent US.. 
Sequence 10 from patent US 59522.. 
Sequence 5 from patent US 608054.. 
Sequence 5 from patent US 5871964.. 
Sequence 4 from patent US 5491075.. 
unnamed protein product [Senna o. . 
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Alignments 

Get selected sequences 
Select all 
Deselect all 



> qi I 3209905 I qbl AAC2 08 4 9. 1 | 181608 Sequence 4 from patent US 5710019 
Length - 494 

Score =21.4 bits (43), Expect = 47 
Identities = 5/5 (100%), Positives - 5/5 (100%) 

Query: 12 LDDCC 16 
LDDCC 

Sbjct: 123 LDDCC 127 



> qi 114 1184211 qb I7VAE5966Q . 1 j Sequence 90 from patent US 6187548 
Length = 4655 

Score =21.0 bits (42), Expect = 63 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 11 HLDDC 15 
HLDDC 

Sbjct: 3096 HLDDC 3100 



> qii 14118419|qb|AAE59658.1 I 



Length = 4655 



Sequence 86 from patent US 6187548 



4 



Score = 21.0 bits (42)., Expect = 63 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 11 HLDDC 15 
HLDDC 

Sbjct: 3096 HLDDC 3100 



> qi I 14118420 I qb I AAE59659 . 1 1 Sequence 88 from patent US 6187548 
Length = 4655 

Score =21.0 bits (42), Expect = 63 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 11 HLDDC 15 
HLDDC 

Sbjct: 3096 HLDDC 3100 



> qi i 15124 7 58 ! qb j AAE7 32 1 9 . 1 I Sequence 84 from patent US 6239270 
Length - 4654 

Score = 21.0 bits (42), Expect = 63 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 11 HLDDC 15 
HLDDC 

Sbjct: 3096 HLDDC 3100 



> qi 114118418 1 qb | AAE5 9657 . 1 | Sequence 84 from patent US 6187548 
Length = 4655 

Score = 21.0 bits (42), Expect = 63 
Identities - 5/5 (100%), Positives = 5/5 (100%) 

Query: 11 HLDDC 15 
HLDDC 

Sbjct: 3096 HLDDC 3100 



> qi I 59508 02 | qb | AAE0 5Q28 . 1 | Sequence 2 from patent US 5869282 
qi I 1007 0027 | qb|AAE41587 . 1 | Sequence 2 from patent US 6004924 
Length = 1404 

Score - 20.6 bits (41), Expect - 84 
Identities - 5/8 (62%), Positives = 7/8 (87%) 

Query: 8 CXQHLDDC 15 

C ++LDDC 
Sbjct: 604 CAENLDDC 611 



> qi I 3209904 I qb| AAC20848 . 1 1 181607 Sequence 2 from patent US 5710019 
Length = 513 



5 



Score = 20.6 bits (41), Expect = 84 
Identities - 5/6 (83%), Positives = 5/6 (83%) 



Query: 11 
Sbjct: 160 



HLDDCC 16 
HLD CC 
HLDGCC 165 



> qi I 21438796 I emb | CAD3 4 7 85 . 1 | unnamed protein product [Homo sapiens] 
Length = 1105 

Score = 20.2 bits (40), Expect = 113 
Identities = 5/5 (100%), Positives = 5/5 (100%) 

Query: 10 QHLDD 14 
QHLDD 

Sbjct: 749 QHLDD 753 



> qi I 3 997 97 8 1 qb | AAC914 10 . 1 |AR017 52 0 
Length = 1865 

Score = 20.2 bits (40), Expect - 
Identities = 5/5 (100%), Positives 

Query: 10 QHLDD 14 
QHLDD 

Sbjct: 1510 QHLDD 1514 



Sequence 2 from patent US 5777094 
113 

= 5/5 (100%) 



> qi I 18 674 819 I emb I CAD2 32 95 . 1 1 unnamed protein product [Homo sapiens] 
Length = 816 

Score = 19.7 bits (39), Expect - 151 
Identities = 5/7 (71%), Positives = 6/7 (85%) 



Query: 8 CXQHLDD 14 

C QHLD+ 
Sbjct: 110 CPQHLDE 116 



> gi i 12824450 I qb I AAE4 935 6 . 1 | Sequence 1 from patent US 6110686 
Length = 178 

Score = 19.7 bits (39), Expect - 151 

Identities - 6/8 (75%), Positives = 6/8 (75%), Gaps = 2/8 (25%) 

Query: 10 QHL--DDC 15 

QHL DDC 
Sbjct: 125 QHLRLDDC 132 



> gi I 1512165 9 I qb 1 AAE72339 . 1 I Sequence 9 from patent US 6235708 
gi i 16219471 1 qb | AAE7412 5 . 1 I Sequence 9 from patent US 6245529 
Length = 145 

Score = 19.7 bits (39), Expect = -151 



6 



Identities = 6/8 (75%), Positives - 6/8 (75%), Gaps = 2/8 (25%) 

Query: 10 QHL--DDC 15 

QHL DDC 
Sbjct: 103 QHLRLDDC 110 



> gi I 21508815 I qb | AAM58270 . 1 | Sequence 1 from patent US 637 9925 
Length - 1964 

Score == 19.7 bits (39), Expect = 151 
Identities = 5/8 (62%), Positives = 7/8 (87%) 

Query: 8 CXQHLDDC 15 

C ++LDDC 
Sbjct: 349 CEENLDDC 356 



Score =18.5 bits (36), Expect = 365 
Identities - 5/8 (62%), Positives = 6/8 (75%) 

Query: 8 CXQHLDDC 15 

C Q LD+C 
Sbjct: 426 CHQDLDEC 433 



> gi I 1831307 I gb j AAB4 57 61 . 1 | Sequence 13 from patent US 5591821 
Length = 29 

Score = 19.3 bits (38), Expect - 203 
Identities - 5/8 (62%), Positives - 6/8 (75%) 

Query: 14 DCCSXXCN 21 

+CCS CN 
Sbjct: 17 NCCSGSCN 24 



> gi I 1254 37 2 I gbj AAA95 62 6. 1 | Sequence 6 from patent US 5482928 
gi I 3012065 i qb [ AAC11513 . 1 I 175 92 4 Sequence 6 from patent US 5689048 
Length - 25 

Score = 19.3 bits (38), Expect = 203 
Identities = 5/8 (62%), Positives - 6/8 (75%) 

Query: 14 DCCSXXCN 21 

DCC+ CN 
Sbjct: 14 DCCTGSCN 21 



> gi I 18 31303 j gbiAAB4 57 57 . 1 | Sequence 4 from patent US 5591821 
Length = 25 

Score = 19.3 bits (38), Expect = 203 
Identities - 5/8 (62%), Positives - 6/8 (75%) 

Query: 14 DCCSXXCN 21 



7 



+CCS CN 
Sbjct: 14 NCCSGSCN 21 



> qi I 512010 I emb | CAA014 42 . 1 1 ovine IL-1 beta [Ovis sp.] 
Length = 266 

Score = 19.3 bits (38), Expect = 203 
Identities = 5/6 (83%), Positives = 5/6 (83%) 



Query: 8 CXQHLD 13 

C QHLD 
Sbjct: 34 CTQHLD 39 



> qi I 1613525 I qb| AAB165 05 . 1 i Sequence 2 from patent US 5559095 

qi I 18304 59 I qb j AAB4 4 913 . 1 1 Sequence 2 from patent US 5587454 
qi I 4 00057 4 I qb | AAC94 006 . 1 | AR024 7 4 8 Sequence 2 from patent US 57 958 64 

qi 159425691 qb | AAE02101 . 1 | Sequence 2 from patent US 5859186 

qi I 5972562 i qb | AAE12453 . 1 I Sequence 2 from patent US 5824645 

qi I 722 3963 1 qb| AAE2413Q. 1 | Sequence 2 from patent US 5891849 

qi I 10054 011 I qb [ AAE30918 . 1 | Sequence 2 from patent US 5965534 

qi I 100 64 28 4 I qb | AAE38 708 . 1 | Sequence 2 from patent US 5994305 

qi I 12 815 62 6 I qb | AAE4 5 4 35 . 1 | Sequence 2 from patent US 6087091 

qi 1144778641 qb | AAE614 27 . 1 | Sequence 2 from patent US 6136786 
Length = 25 



Score - 19.3 bits (38), Expect = 203 
Identities = 5/8 (62%), Positives = 6/8 (75%) 



Query: 14 DCCSXXCN 21 

DCC+ CN 
Sbjct: 14 DCCTGSCN 21 



> gi 1210658161 emb j CAD32 04 6 . 1 | unnamed protein product [synthetic construct] 
Length = 2444 

Score =18.9 bits (37), Expect - 272 
Identities - 6/10 (60%), Positives = 7/10 (70%) 



Query: 12 LDDCCSXXCN 21 

LDDC S C+ 
Sbjct: 642 LDDCASSPCD 651 



Score = 16.8 bits (32), Expect = 1184 
Identities = 4/4 (100%), Positives = 4/4 (100%) 

Query: 10 QHLD 13 
QHLD 

Sbjct: 1847 QHLD 1850 



Score - 12.9 bits (23), Expect - 16694 
Identities = 4/6 (66%), Positives = 4/6 (66%) 



8 



Query: 16 
Sbjct: 1464 



CSXXCN 21 
CS CN 
CSLQCN 1469 



> qi i 14110683 I qb I AAE57223 . 1 j Sequence 6 from patent US 6171816 
qi I 15592507 I emb I CAC697 04 . 1 1 unnamed protein product [Homo sapiens] 
Length =166 

Score =18.9 bits (37), Expect = 272 
Identities = 4/6 (66%), Positives = 5/6 (83%) 

Query: 10 QHLDDC 15 

HL+DC 
Sbjct: 66 HHLEDC 71 



> qi I 4 52 6603 | emb! CAAQ04 57 . 1 | MAb MBrl VK sequence [synthetic construct] 
Length = 119 

Score =18.9 bits (37), Expect = 272 
Identities = 5/6 (83%), Positives = 5/6 (83%) 

Query: 8 CXQHLD 13 

C QHLD 
Sbjct: 98 CQQHLD 103 



> qi I 14103731 i qb [ AAE55165 . 1 \ Sequence 8 from patent US 6153740 
Length = 648 

Score =18.9 bits (37), Expect = 272 
Identities = 4/5 (80%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LD+CC 

Sbjct: 635 LDNCC 639 



> gi I 14103728 I qb j AAE55162 . 1 I Sequence 4 from patent US 6153740 
Length - 650 

Score =18.9 bits (37), Expect = 272 
Identities = 4/5 (80%), Positives = 5/5 (100%) 

Query: 12 LDDCC 16 
LD+CC 

Sbjct: 636 LDNCC 640 



> gi |1411068Hqbj AAE57 221 . 1 I Sequence 2 from patent US 6171816 
qi|15592501| emb I CAC697 03 . 1 j unnamed protein product [Homo sapiens] 

qi I 17 920557 I qb| AAE8 617 7 . 1 j Sequence 106 from patent US 6312922 
Length = 175 



9 



Score =18.9 bits (37), Expect = 272 
Identities = 4/6 (66%), Positives = 5/6 (83%) 

Query: 10 QHLDDC 15 

HLD4-C 
Sbjct: 76 HHLDEC 81 



> gi i 14110684 I qb| AAE57 22 4 . 1 | Sequence 7 from patent US 6171816 
Length = 183 

Score =18.9 bits (37), Expect = 272 
Identities = 4/6 (66%), Positives = 5/6 (83%) 

Query: 10 QHLDDC 15 

HL+DC 
Sbjct: 83 HHLEDC 88 



> gi I 12 813715 j gb j AAE4 5015 . 1 j Sequence 20 from patent US 6083904 
Length = 2556 

Score =18.9 bits (37), Expect = 272 
Identities = 6/10 (60%), Positives = 7/10 (70%) 

Query: 12 LDDCCSXXCN 21 

LDDC S C+ 
Sbjct: 642 LDDCASSPCD 651 



Score - 16.8 bits (32), Expect = 1184 
Identities = 4/4 (100%), Positives = 4/4 (100%) 

Query: 10 QHLD 13 
QHLD 

Sbjct: 1847 QHLD 1850 



Score = 12.9 bits (23), Expect = 16694 
Identities = 4/6 (66%), Positives = 4/6 (66%) 

Query: 16 CSXXCN 21 

CS CN 
Sbjct: 1464 CSLQCN 1469 



> gi I 141037 30 i gb | AAE5 5164 . 1 | Sequence 6 from patent US 6.153740 
Length - 64 9 

Score =18.9 bits (37), Expect = 272 
Identities = 4/5 (80%), Positives = 5/5 (100%) 



Query: 12 LDDCC 16 
LD+CC 

Sbjct: 635 LDNCC 639 



10 



> qi I 1410372 9 1 gb j AAE55163 . 1 ! Sequence 5 from patent US 6153740 
Length =64 9 

Score - 18.9 bits (37), Expect = 272 
Identities = 4/5 (80%), Positives - 5/5 (100%) 

Query: 12 LDDCC 16 
LD+CC 

Sbjct: 635 LDNCC 639 



> gi I 27 058 8 I gb | AAA02 097 . 1 | Sequence 2 from Patent US 4879374 
Length = 266 

Score =18.9 bits (37), Expect = 272 
Identities = 5/6 (83%), Positives = 5/6 (83%) 

Query: 8 CXQHLD 13 

C QHLD 
Sbjct: 34 CIQHLD 39 



> qi I 3994136 I qbj AAC87564 . H ARQ07930 Sequence 17 from patent US 5750652 
Length = 2556 

Score - 18.9 bits (37), Expect - 272 
Identities = 6/10 (60%), Positives = 7/10 (70%) 

Query: 12 LDDCCSXXCN 21 

LDDC S C+ 
Sbjct: 642 LDDCASSPCD 651 



Score - 16.8 bits (32), Expect = 1184 
Identities = 4/4 (100%), Positives - 4/4 (100%) 

Query: 10 QHLD 13 
QHLD 

Sbjct: 1847 QHLD 1850 



Score - 12.9 bits (23), Expect - 16694 
Identities = 4/6 (66%), Positives = 4/6 (66%) 

Query: 16 CSXXCN 21 

CS CN 
Sbjct: 1464 CSLQCN 1469 



> gi I 14110 685 i gb | AAE57 22 5 . 1 | Sequence 8 from patent US 6171816 
Length = 170 

Score - 18.9 bits (37), Expect = 272 
Identities - 4/6 (66%), Positives = 5/6 (83%) 



11 



Query: 10 QHLDDC 15 

HL+DC 
Sbjct: 70 HHLEDC 75 



21. 



SI 



£1 



18 304 86 I qb I AAB4 4 94 0. 1 | Sequence 2 9 from 
4 000 601 I qbl AAC94 033. 1 | AR024 77 5 Sequence 



5942596|qb|AAE02128.1| 



59725891 qbl AAE12480. 1 I 



7 223990 i gb| AAE24157 . 1 | 



10 0 64 338 I qb i AAE38 735 . 1 I 



12815 653 I qbl AAE4 5 4 62. 1 | 



14477891 |qb|AAE61454 . 1 | 



Length =26 



patent US 5587454 
29 from patent US 
Sequence 29 from patent US 5859186 
Sequence 29 from patent US 5824645 
Sequence 29 from patent US 5891849 
Sequence 29 from patent US 5994305 
Sequence 29 from patent US 6087091 
Sequence 29 from patent US 6136786 



5795864 



Score - 18.5 bits (36), Expect = 365 
Identities = 5/7 (71%), Positives = 5/7 



(71%) 



Query: 14 DCCSXXC 20 

DCCS C 
Sbjct: 14 DCCSGSC 20 



> qi I 141197 03 I gb | AAE60017 . 1 | Sequence 8 from patent US 6190876 

qi I 20220418 I gb I 7VAE8 68 66 . 1 | Sequence 8 from patent US 6319704 

gi 1215170111 qb |7\Aiy!604 25 . 1 | Sequence 8 from patent US 6399350 
Length =74 9 

Score =18.5 bits (36), Expect = 365 
Identities = 4/4 (100%), Positives = 4/4 (100%) 

Query: 13 DDCC 16 
DDCC 

Sbjct: 482 DDCC 485 



> gi I 3011710 I gb I AAC11158 .1117 55 69 Sequence 3 from patent US 5688764 
Length = 39 

Score =18.5 bits (36), Expect = 365 
Identities = 5/7 (71%), Positives = 5/7 (71%) 

Query: 14 DCCSXXC 20 

DCCS C 
Sbjct: 13 DCCSGSC 19 



161354 2 I gb| AAB16522 . 1 1 Sequence 19 from patent US 5559095 
183047 6 1 gb I AAB4 4 930 . 1 | Sequence 19 from patent US 5587454 
4 000591 1 qbl AAC94023. 1 1 AR024765 Sequence 19 from patent US 5795864 



5942586iqb|AAE02118.1| 
5972579|qb|AAE12470.1| 
7223980|qb!AAE24147.1| 
100 64 318 I qbj AAE3872 5 . 1 I 
12815643|qb|AAE45452.1| 
14477881 |gb!AAE61444 . 1 | 
Length =26 



Sequence 19 from patent US 5859186 
Sequence 19 from patent US 5824645 
Sequence 19 from patent US 5891849 
Sequence 19 from patent US 5994305 
Sequence 19 from patent US 6087091 
Sequence 19 from patent US 6136786 
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Score =18.5 bits (36), Expect = 365 
Identities = 5/7 (71%), Positives - 5/7 (71%) 

Query: 14 DCCSXXC 20 

DCCS C 
Sbjct: 14 DCCSGSC 20 



> qi I 1831302 I gb| AAB4 575 6. 1 1 Sequence 3 from patent US 5591821 
Length = 2 6 

Score =18.5 bits (36), Expect = 365 
Identities = 5/7 (71%), Positives - 5/7 (71%) 

Query: 14 DCCSXXC 20 

DCCS C 
Sbjct: 14 DCCSGSC 20 



> qi I 2725130 I qb| AAB92812 . 1 | 167 150 Sequence 21 from patent US 5670319 
qi I 320887 4 I qb| AAC2Q4 55. 1 | 18 058 4 Sequence 21 from patent US 5708142 
qi I 3 9918 52 I qb| AAC8527 9. 1 | AR002 8 03 Sequence 21 from patent US 5741667 
qi I 5 95137 8 I qb| AAE0 5 604 . 1 | Sequence 21 from patent US 5869612 
Length = 28 

Score = 18.5 bits (36), Expect = 365 
Identities = 4/5 (80%), Positives = 5/5 (100%) 

Query: 11 HLDDC 15 

HLD+C 
Sbjct: 24 HLDEC 2 8 



> gi I 4 774193 I emb | CAB42495 . 1 1 unnamed protein product [unidentified] 
Length = 2321 

Score - 18.5 bits (36), Expect = 365 
Identities = 5/6 (83%), Positives = 5/6 (83%) 

Query: 11 HLDDCC 16 

HLDD C 
Sbjct: 350 HLDDAC 355 



> gi i 4134 31 I emb | CAA007 67 . 1 | 0RF2 [synthetic construct] 
Length = 167 

Score =18.5 bits (36), Expect = 365 
Identities = 4/4 (100%), Positives = 4/4 (100%) 



Query: 13 DDCC 16 
DDCC 

Sbjct: 36 DDCC 39 



13 



> gi I 1004 9538 I gb | AAE2644 5 . 1 | Sequence 13 from patent US 5948900 
Length - 306 

Score = 18.5 bits (36), Expect = 365 
Identities = 5/6 (83%), Positives = 5/6 (83%) 

Query: 12 LDDCCS 17 

LDDC S 
Sbjct: 297 LDDCSS 302 



> gi I 128 07 911 |gb I AAE4 3131 . 1 1 Sequence 4 from patent US 6074840 
Length = 1253 

Score =18.5 bits (36), Expect - 365 
Identities = 4/4 (100%), Positives = 4/4 (100%) 

Query: 13 DDCC 16 
DDCC 

Sbjct: 1093 DDCC 1096 



> gi I 30117 09 I gb| AAC11157 . 1 | 17 5568 
Length = 39 

Score =18.5 bits (36), Expect = 
Identities - 5/7 (71%), Positives 

Query: 14 DCCSXXC 2 0 

DCCS C 
Sbjct: 13 DCCSGNC 19 



Sequence 2 from patent US 5688764 

365 
= 5/7 (71%) 



> gi {21065678 I emb | CAD32037 . 1 1 unnamed protein product [Homo sapiens] 
Length = 111 

Score =18.5 bits (36), Expect = 365 
Identities = 4/5 (80%), Positives = 5/5 (100%) 

Query: 13 DDCCS 17 

D+CCS 
Sbjct: 35 DECCS 39 



> gi I 202 4167 3 I gb | AAE92 34 4 . 1 | Sequence 338 from patent US 634 8328 
Length = 243 

Score =18.5 bits (36), Expect = 365 
Identities = 4/4 (100%), Positives = 4/4 (100%) 

Query: 13 DDCC 16 
DDCC 

Sbjct: 240 DDCC 243 



> gi I 3994135 I gb | AAC8 7 5 63 . 1 | AR0 07 92 9 ' Sequence 16 from patent US 5750652 
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gi I 12813714 I gb | AAE4 5014 . 1 | Sequence 19 from patent US 6083904 
Length = 2471 

Score - 18.5 bits (36), Expect 365 
Identities - 5/6 (83%), Positives - 5/6 (83%) 

Query: 11 HLDDCC 16 

HLDD C 
Sbjct: 37 4 HLDDAC 37 9 



Score =18.5 bits (36), Expect = 365 
Identities = 6/13 (46%), Positives = 8/13 (61%) 

Query: 8 CXQHLDDCCSXXC 20 

C + LD+C S C 
Sbjct: 1146 CEEQLDECASNPC 1158 



> gi 118674249 1 emb I CAD232 72 . 1 j unnamed protein product [Homo sapiens] 
gi {21522704 I emb \ CAD354 06 . 1 j unnamed protein product [Homo sapiens] 
Length = 2470 

Score = 18.5 bits (36), Expect = 365 
Identities = 5/6 (83%), Positives = 5/6 (83%) 

Query: 11 HLDDCC 16 

HLDD C 
Sbjct: 373 HLDDAC 378 



Score =18.5 bits (36), Expect = 365 
Identities - 6/13 (46%), Positives - 8/13 (61%) 

Query: 8 CXQHLDDCCSXXC 20 

C + LD+C S C 
Sbjct: 1145 CEEQLDECASNPC 1157 



> gi I 3 994138 I gb | AAC8 7566. 1 | AR0 07 932 Sequence 19 from patent US 5750652 
Length = 2703 

Score - 18.5 bits (36), Expect = 365 
Identities = 5/6 (83%), Positives = 5/6 (83%) 

Query: 11 HLDDCC 16 

HLDD C 
Sbjct: 408 HLDDAC 413 



Score =14.6 bits (27), Expect = 5150 
Identities = 3/4 (75%), Positives = 4/4 (100%) 

Query: 12 LDDC 15 
LD+C 

Sbjct: 1416 LDEC 1419 
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Score =12.9 bits (23), Expect = 16694 
Identities = 4/9 (44%), Positives = 6/9 (66%) 

Query: 13 DDCCSXXCN 21 

++C S CN 
Sbjct: 67 9 NECHSNPCN 687 



> qi j 30117 08 I gb | AAC1115 6 . 1 II755 67 Sequence 1 from patent US 5688764 
Length = 39 

Score =18.5 bits (36), Expect = 365 
Identities = 5/7 (71%), Positives = 5/7 (71%) 

Query: 14 DCCSXXC 20 

DCCS C 
Sbjct: 13 DCCSGNC 19 



> gi I 14 097 4 64 I gb j AAE5230 6 . 1 | Sequence 5 from patent US 6133232 
Length = 979 

Score =18.5 bits (36), Expect = 365 
Identities = 4/5 (80%), Positives = 5/5 (100%) 

Query: 11 HLDDC 15 
HLD+C 

Sbjct: 798 HLDNC 802 



Get selected sequences 
Select all 
Deselect all 



Database: Protein sequences derived from the Patent division of 
GenBank 

Posted date: Nov 25, 2002 1:06 AM 
Number of letters in database: 16,453,681 
Number of sequences in database: 104,350 

Lambda K H 

0.357 0.284 2.14 

Gapped 

Lambda K H 

0.294 0.110 0.610 



Matrix: PAM30 

Gap Penalties: Existence: 9, Extension: 1 
Number of Hits to DB: 379,047 
Number of Sequences: 104350 
Number of extensions: 3808 
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Number of successful extensions: 641 

Number of sequences better than 20000.0: 623 

Number of HSP's better than 20000.0 without gapping: 623 

Number of HSP's successfully gapped in prelim test: 0 

Number of HSP's that attempted gapping in prelim test: 0 

Number of HSP's gapped (non-prelim): 641 

length of query: 27 

length of database: 16,453,681 

effective HSP length: 18 

effective length of query: 9 

effective length of database: 14,575,381 

effective search space: 131178429 

effective search space used: 131178429 

T: 11 

A: 40 

XI: 14 ( 7.2 bits) 
X2: 35 (14.8 bits) 
X3: 58 (24 . 6 bits) 
SI: 23 (13.7 bits) 
S2: 23 (12.9 bits) 



Search for : 

CHIXNQHCWQHLDDCCSHHCNHWNHCV 

Chose to make the most divergent peptide (still within scope of generic) to test what would match. 



BLASTP 2.2.4 f Aug-26-2002] 



Reference : 

Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs". Nucleic Acids Res. 25:338 9-34 02. 



RID: 1038288569-017646-1278 



Query= 

(27 letters) 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,242,768 sequences; 395,571,179 total letters 



If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 



Taxonomy reports 



Distribution of 157 Blast Hits on the Query Sequence 



Sequences producing significant alignments: 


Score 
(bits) 


E 

Value 




3891872 Ipdbl IKCPI 3d Structure Of K-Conotoxin Pviia, A ... 


52 


6e-07 


gi 


7519866 Ipir 1 IA58997 kappa-conotoxin PVIIA - cone shell (.., 


52 


6e-07 


gi 
qi 


3891964 Ipdbl 1AV3 i Potassium Channel Blocker Kappa Conot , . , 
22002036 1 spl P56633 i CXK7 CONPU Kappa-conotoxin PVIIA prec. . 


52 
52 


6e-07 
6e-07 


gi 


6907097 1 dbj 1 BAA90624 . 1 1 Similar to Arabidopsis thaliana .. 


29 


4.5 


gi 


21402496 1 ref 1 NP 658481.11 minC, Bacterial proteins invol . . 


28 


15 


gi 


13376410 1 reflNP 079212.1] hypothetical protein FLJ23024 


27 


26 


gi 


1085397 Ipir 1 1 S46487 taurine transporter - human >gi|5598.. 


27 


35 


gi 


1352535 1 spl P31641 1 S6A6 HUMAN Sodium- and chloride-depend.. 


27 


35 


gi 


13560627|gb|AAK30132.1|AF346763 1 intestinal taurine tra.. 


27 


35 



2 





/ o4o Zo \ go \ hAAl o / b o . 1 1 nypo rne l icai prorexn [ fiasinoaiuiu . . . 


9 7 
Z / 


O 0 


gi 


4oU/U4b|reriiNF uuJU-34i.i| soiure earner raiuiiy d ( neuro l . . . 


9 7 
Z / 


JO 


gi 


X / Uood4 I spl tr ID 1 \ iiKri L.riiLi\ LiOw— usns 1 tiy iipopiTO rem rec . . . 


9 7 
Z / 


o 0 




UbzJJUo 1 go 1 AAHu Dz: . i [ AAriu oz oz unKnown tproLein ror lyiluL, . . . 


9 7 
Z / 


O 0 




zllUa 1 oD 1 gJo 1 AAM J 4 / yu.iiAhiDizoou z nAu4 [ AraDioops IS ma J. . . . 


9 

Z D 


/I 7 




dU i U b 5 4 i go 1 AAe UizU/.i|ArioUozo 1 Ji.— septiin snort, rorin [ Ka . . . 


9 

Z D 


A 7 


gi 


lo4L)obzz!rer|Nr oD4/Ub.i| uKbr}— Dinaing prorem, purative . , . 


z o 


A 7 
^ / 


gi 


1 4 / 94 y bb 1 gu 1 AAK / Jo 1 y . 1 1 HA04 [AraDiciopsis tnaiianaj 


9 

Z D 


4 / 


qi 


13 92 9z 00 i rer 1 NP li4Uzo.i| b-septm [Kattus norvegicusj >... 


9 

Z D 


/I ^7 

4 / 


gi 


20093168 i ref 1 NP 619243. 1| nypotnetical protein (multi-ao... 


9 

Z O 


/l 9 

4 / 


gi 


60 908 0 1 1 gb 1 AAF033 / b . 1 1 Ar 1 / Uz 03 1 septm-liKe protein [Ka... 


9 

Z D 


A 9 

4 / 


gi 


6683817 1 gb 1 AAF23374 . 1 1 MLL septin-like rusion protein Mb... 


9 

Z. b 


4 / 


qi 


17 5 60 900 i ref 1 NP o0b2b4.1| Freaictea CUb, major sperm pro... 


Z D 


/ 9 

4 / 


qi 


8778322 1 gb 1 AAF / 9331 . 1 1 AC0Uz3U4 Z4 t'14Jlb,z/ [AraDidopsis . . . 


9 

Z D 


/I 9 

4 / 


gi 


0 / 2 y 9 33 1 ret 1 NP UUbb3i.l| IMLL septm-iiKe rusion, N-L-L sep... 


9 

Z D 


/I 9 
4 / 


gi 


24bob402 jretlNr blloUZ.Zl LbiU4 oo-rA [ urosopniia meianog . . . 


9 

Z O 


A 9 


gi 


220bb3bo i ret 1 XP ll3oyz.Z| similar to mLiLi septin-iiKe rus , , . 


9 

z b 


A 9 
4 / 


gi 


14 o3010 / I emo 1 CAC4 zzzz . 1 | UVAKIAN/ breast sep tin aeita [ho . . . 


9 

Z D 


/I 9 
4 / 


gi 


b 0 0 b 3 o o 1 go 1 AAr u z o 4 y . 1 } ALU u y o y 4 z u un Known protein [ Araoi . . . 


9 

Z D 


/I 7 
4 / 


gi 


10433911 1 qd;] | BAB 14 Uo / . 1 | unnameu protein proauct [Homo s . . . 


9 

Z D 


/I 9 
4 / 


gi 


o393/o4|re±[NP Uoyu/b.l| sept m y ; JxiLL sept in— xiKe rusio . . . 


9 CZ. 
Z O 


A 9 
4 / 


gi 


14530105 1 emb 1 CAC42221 . 1 1 OVARIAN/ Breast septm gamma [Ho... 


9 

Z b 


A 9 
4 / 


gi 


7512587 Ipirl |Tlzol9 nypotnetical protein UKr Zip4 34b Z4o . 1 ... 


9 

Z D 


/I 9 
4 / 


gi 


4 58 9 62 6 1 dbj 1 BAA / bo 3o . 1 i KlAAUyyi protein [homo sapiensj ... 


9 

Z D 


/I 9 
4 / 


gi 


18582088 1 ref j XP 09061 /.Ij similar to Adenylate cyclase, ... 


Z b 


4 / 


gi 


bU 9 0 9 1 1 1 go 1 AAi U3 3 y 1 . 1 1 Ah 1 / oo y y 1 septm-iiKe protein biir,.. 


9 

Z D 


A 9 
4 / 


gi 


2 0 8 925 33 1 ref 1 XP lobl44.1| nypotnetical protein aP lobi44.,. 


9 £^ 

Z b 


A 9 

4 / 


gi 


23UooO /O 1 go i ZP UUUo3i/o.l| nypotnetical protein [rseuaom... 


9 

z o 


9 

bo 


gi 


24 4 59580 1 gb j AAN3438 1 . 1 1 envelope glycoprotein [Human imm, . . 


9 

Z D 


9 
UO 


gi 


2 0 D 4 4 4 9 4 1 re t 1 XP 13/yoZ.l| similar to putative retrovirus... 


9 

Z b 


CZ. 9 
b J 


gi 


yb2yb32|rei|NP U44yi/-ll tegument protein / r taAKAi [mur la . . . 


9 

Z D 


9 
DO 


gi 


z32 /2oo2 1 gb 1 AAH300 / 4 . 1 1 cnromosome o open reaamg rrame ... 


9 

z o 


9 
DO 


gi 


1 3 4 4 92 8 / 1 re t 1 NP Uooloo.il Nbo protein [Homo sapiensj >gi... 


z b 


£^ 9 

bo 


gi 


1 692 4 037 1 gb 1 AAL3 1 64 9 . 1 i AC07 91 / 9 4 Putative o-metnyltran. . . 


Z 0 


o c: 
O 0 


gi 


23472065 1 gb 1 ZP 00127393.11 hypothetical protein [Pseudom. . . 


2o 


o c 
O 0 


gi 


2 0 908 8 1 1 1 ref 1 XP 138742.11 similar to endogenous retrovir. . . 


9 R 

Zo 


Q c; 
o O 


gi 


155 9837 5 1 ref 1 NP 251869.11 conserved nypotnetical protein... 


2o 


o c 

8 0 


gi 


23058 698 [ gb 1 ZP 00083733.11 hypothetical protein [Pseudom... 


ZO 


o c 
O O 


gi 


204 688 66 1 ref 1 XP 117141.11 hypothetical protein XP 11/141... 


25 


o c 

8 5 


gi 


159834 10 1 gb ! AALl 1573 . 1 1 AF4 24 57 9 1 AT4gl65 /0/dl431Uw [Ara... 


25 


114 


gi 


22094360 i gb j AAM91887 . 1 1 putative cytokmin oxidase [Oryz... 


Z O 


11/1 
114 


gi 


202106 1 gb 1 7VAA404 68 . 1 i transition protein 2 (alt.) 


o c; 
Z O 


11/1 
114 


gi 


110055 1 pir M S14529 transition protein 2 - mouse >gi|o48/... 


ZO 


11/1 
114 


gi 


18 4 14 696 j ref 1 NP 567508.11 expressed protein; protein id:... 


Z 0 


11/1 
114 


gi 


2 13022 70 1 gb 1 EAAl 4 4 1 5 . 1 1 agCPobUo [Anopheles gamoiae str.... 


Z O 


1 1 /J 
114 


gi 


24 8 17521 j emb 1 CAD54 14 7 . 2 1 Hypothetical protein ZK757.4 [C... 


2o 


114 


gi 


202105 1 gb 1 AAA404 67 . 1 1 transition protein 2 (alt.) >gi|12... 


25 


114 


gi 


17 557 07 1 1 ref i NP 499189.11 DHHC zinc binding domain conta... 


25 


11/1 
114 


gi 


12 0835 91 i ref 1 NP 038722.2 [ transition protein 2 [Mus muse... 


Z 0 


11/1 
114 


gi 


24 64 6280 1 ref 1 NP 650191.11 CG5196-PA [Drosophila melanoga... 


o c 
25 


11^ 
114 


gi 


15836695 i ref 1 NP 29/383.1 | tRNA delta ( z ) -isopent enylpyrop .. . 


9 R 

Z 0 


114 


gi 


20151945 ! gb 1 AAM11332 . 1 1 GH0b/59p [Drosopnila melanogaster J 


Z 0 


11/1 
114 


gi 


1 65 1 9 5 3 9 ! ret 1 NP 443 /3 /.ii low density lipoprotem-reiate . . . 


9 c; 
Z O 


1 1 j1 

114 


gi 


z3Uoo34 3 igoj Lr uuuoi4oJ. 1 | nypotnetical protein [beooact . . . 


9 R 
Z O 


11/1 
114 


gi 


IOj'O / Zi?! jgJOj r-l-rii\0 U 1 Z Z . ± | rn.^ \J O i 1 Z) i I ^JUL-dLJ-Vt:; ^lHJo^].iUJ.J_^cioC... 




X X 4 


gi 


930024 1 emb ICAA30824 . 1 1 tenascin [Gallus gallus] 


24 


153 


gi 


135584 i sp 1 P10039 1 TENA CHICK Tenascin precursor (TN) (Hex... 


24 


153 


gi 


15230656 1 ref !NP 187904.11 putative CREB-binding protein;... 


24 


153 


gi 


212748 |qbiA7=LA49085. 1 1 190 kd tenascin precursor 


24 


153 


gi 


18546360 IreflXP 099257.1] hypothetical protein XP 099257... 


24 


153 


gi 


212747 1 gb|AAA49084 . 1 i 200 kd tenascin precursor 


24 


153 
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9i 


t 21105780 1 gb 1 AAM34788 . 1 IAF512558 2 HAC5 [Arabidopsis thai... 


24 


153 


gi 


1 15341664 1 gb 1 AAH06778 . 1 1 Unknown (protein for IMAGE : 358 90 .. . 


24 


205 


gi 


1 2507193 1 sp 1 P27704 i MK06 RAT Mitogen-activated protein kin... 


24 


205 


gi 


1 13928856 i ref 1 NP 113810.1] mitogen-activated protein kina . . . 


24 


205 


gi 


1 24583179 1 ref 1 NP 609327.21 CG31712-PA [Drosophila melanog... 


24 


205 


gi 


2204 9606 1 ref 1 XP 070277.21 similar to Otoconin 90 precurs . . . 


24 


205 


gi 


235 92724 j ref 1 XP 128141.2 1 otoconin 90 [Mus musculus] 


24 


205 


gi 


4506139 1 ref 1 NP 002759.11 procollagen (type III) N-endope... 


24 


205 


gi 


22996345 1 gb 1 ZP 00040604.11 hypothetical protein [Xylella... 


24 


205 


gi 


22993598 i gb 1 ZP 00038168.11 hypothetical protein [Xylella... 


24 


205 


gi 


13509205 1 emb 1 CAC35209 . 1 1 GRAAL2 protein [Drosophila mela... 


24 


205 


gi 


6682303 1 emb 1 CAB64 653 , 1 1 GRAAL protein [Drosophila melano... 


24 


205 


gi 


7488985 1 pir 1 1 T174 62 disease resistance E - tomato >gi|42... 


24 


205 


qi 


15292455 1 gb|AAK93496. 1 [ SD02860p [Drosophila melanoqaster ] 


24 


205 


gi 


18593306 1 ref 1 XP 104434.11 hypothetical protein XP 104434... 


24 


205 


gi 


21430770 i gb 1 AAM51063 . 1 1 SD13780p [Drosophila melanogaster ] 


24 


205 


gi 


4092677 1 gb|AAC99455. 1 1 otoconin-90 [Mus musculus] 


24 


205 


gi 


174 64262 1 ref 1 XP 069418.11 hypothetical protein XP 069418... 


24 


205 


gi 


16416462 1 dbj 1 BAB70658 . 1 1 spondin [Ciona savignyi] 


24 


205 


gi 


4176764 1 gb IAAD08924 . 1 j otoconin-95 precursor [Mus musculus] 


24 


205 


gi 


1676264 6 1 ref 1 NP 458263.11 alpha-amylase [Salmonella ente... 


24 


205 


gi 


2323962 1 i gb j AAH354 92 . 1 i similar to mitogen-activated pro... 


24 


205 


gi 


looooyio 1 re riwr D/iyjb.ii ]]aggeai [iJanio rerio j >gi ] z u 4 o , , , 


Z 4 


n r\ c 

205 


gi 


167669491 ref |NP 462564.11 alpha-amylase [Salmonella typh... 


24 


205 


gi 


4506091 1 ref 1 NP 002739.11 mitogen-activated protein kinas . . . 


24 


205 


gi 


6225792 1 sp 1 Q02509 1 OC90 HUMAN Otoconin 90 precursor {Oc90... 


24 


205 


gi 


559705 1 dbj IBAA07557. 11 The hal551 gene is novel. [Homo s... 


24 


205 


gil 


24661359iref INP 648288.1! CG4821-PA [ Drosophila melanoqa .. . 


24 


205 


gi| 


59977 1 emb 1 CAA78662 . 1 1 tripartite fusion transcript PLA2L. . . 


24 


205 



Aligrxnients 

Get selected sequences 
Select all 
Deselect all 



> gii3891872|pdbilKCP| 3d Structure Of K-Conotoxin Pviia, A Novel Potassium 
Channel-Blocking Toxin From Cone Snails, Nmr, 22 
Structures 
Length - 28 

Score =52.4 bits (116), Expect - 6e-07 
Identities = 19/27 (70%), Positives = 21/27 (77%) 

Query: 1 CHIXNQHCWQHLDDCCSHHCNHWNHCV 27 

C I NQ C+QHLDDCCS CN +N CV 
Sbjct: 1 CRIXNQKCFQHLDDCCSRKCNRFNKCV 27 



> gi I 7519866 I pir I iA58997 kappa-conotoxin PVIIA - cone shell (Conus purpurascens ) 
Length = 27 

Score =52.4 bits (116), Expect = 6e-07 
Identities = 18/27 (66%), Positives = 20/27 (74%) 

Query: 1 CHIXNQHCWQHLDDCCSHHCNHWNHCV 27 
C I NQ C+QHLDDCCS CN +N CV 



4 



Sbjct: 1 CRIPNQKCFQHLDDCCSRKCNRFNKCV 27 



> gi|3891964 | pdb | 1AV3 I Potassium Channel Blocker Kappa Conotoxin Pviia From C. 
Purpurascens, Nmr, 20 Structures 
Length - 27 

Score - 52.4 bits (116), Expect = 6e-07 
Identities - 19/27 (70%), Positives = 21/27 (77%) 

Query: 1 CHIXNQHCWQHLDDCCSHHCNHWNHCV 27 

C I NQ C+QHLDDCCS CN +N CV 
Sbjct: 1 CRIXNQKCFQHLDDCCSRKCNRFNKCV 27 



> gi I 22002036 [ sp | P56633 | CXK7 CONPU Kappa-conotoxin PVIIA precursor (Fin-popping peptide) 
Length = 72 

Score = 52.4 bits (116), Expect = 6e-07 
Identities = 18/27 (66%), Positives = 20/27 (74%) 

Query: 1 CHIXNQHCWQHLDDCCSHHCNHWNHCV 27 

C I NQ C+QHLDDCCS . CN +N CV 
Sbjct: 4 6 CRIPNQKCFQHLDDCCSRKCNRFNKCV 72 



> gi|6907097|dbj IBAA90624.H Similar to Arabidopsis thaliana DNA chromosome 4, ESSA I 
contig 

fragment No. 6; calcium channel protein alpha-1 chain 
isoform A - rat. (Z97341) [Oryza sativa (japonica 
cultivar-group) ] 
Length =58 9 

Score = 29.5 bits (62), Expect = 4.5 

Identities = 8/15 (53%), Positives = 9/15 (60%), Gaps = 4/15 (26%) 

Query: 17 SHH CNHWNHCV 27 

SHH C+HW CV 
Sbjct: 292 SHHMKDWCDHWKQCV 306 



> gi|21402496|ref |NP 658481. 1[ minC, Bacterial proteins involved in chromosomal 
partioning 

[Bacillus anthracis A2012] 
Length = 228 

Score =27.8 bits (58), Expect = 15 
Identities = 7/7 (100%), Positives = 7/7 (100%) 

Query: 11 HLDDCCS 17 

HLDDCCS 
Sbjct: 21 HLDDCCS 27 



> gi| 13376410|ref |NP Q79212.il hypothetical protein FLJ23024 [Homo sapiens] 
gij 104 39583 jdbj I BAB15524.il unnamed protein product [Homo sapiens] 
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Length = 279 
Score - 26.9 bits (56), Expect = 26 

Identities - 11/30 (36%), Positives = 11/30 (36%), Gaps - 17/30 (56%) 

Query: 5 NQHCWQHLDDCCSHHCNH WNHC 2 6 

NQHC HCN WNHC 

Sbjct: 157 NQHC EHCNSCTSKDGRKWNHC 177 



> giil085397|pir| IS46487 taurine transporter - human 
gi I 55 9853 I qb | AAA5Q842 . 1 | placental taurine transporter 
Length - 620 

Score = 26.5 bits (55), Expect = 35 

Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 3/11 (27%) 

Query: 19 HCNH-WN--HC 26 

HCNH WN HC 
Sbjct: 161 HCNHSWNTPHC 171 



> gi I 1352535 | sp | P31641 | S6A6 HUMAN . Sodium- and chloride-dependent taurine transporter 
gi I 7441660 I pir I IG01426 taurine transporter - human 
gi I 7 99339 | gb | AAC5Q4 4 3 . 1 j taurine transporter 
Length = 620 

Score = 26.5 bits (55), Expect = 35 

Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 3/11 (27%) 

Query: 19 HCNH-WN--HC 26 

HCNH WN HC 
Sbjct: 161 HCNHSWNTPHC 171 



> gi[ 13560627 I qbj AAK30132.il AF34 67 63 1 intestinal taurine transporter [Homo sapiens] 
Length = 244 

Score = 26.5 bits (55), Expect = 35 

Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps - 3/11 (27%) 

Query: 19 HCNH-WN — HC 26 

HCNH WN HC 
Sbjct: 81 HCNHSWNTPHC 91 



> gi|234 82 928 |qb|EAA18 7 66.1 I hypothetical protein [Plasmodium yoelii yoelii] 
Length = 456 

Score = 26.5 bits (55), Expect = 35 

Identities = 11/24 (45%), Positives = 11/24 (45%), Gaps = 12/24 (50%) 

Query: 6 QHCWQHLDDCCSHHC-N--HWNHC 26 

QH WQH C N HW HC 

Sbjct: 18 8 QH-WQH CQNFQHWQHC 202 



6 



> gi |4507Q45|ref|NP 003034. 1[ solute carrier family 6 (neurotransmitter transporter, 
taurine ) , 

member 6; Solute carrier family 6 (neurotransmitter 
transporter, taurine) , [Homo sapiens] 
gi I 34 6388 I pir I IS29839 taurine transport protein - human 
gi I 3 6727 I emb j CAA7 94 81 . 1 | taurine transporter [Homo sapiens] 

gi I 2 66092 I gb [ 7VAB25509 . 1 | taurine transporter, HTAU [human, thyroid cells. Peptide, 619 
aa] 

Length - 619 
Score = 26.5 bits (55), Expect = 35 

Identities - 8/11 (72%), Positives = 8/11 (72%), Gaps = 3/11 (27%) 

Query: 19 HCNH-WN— HC 26 

HCNH WN HC 
Sbjct: 161 HCNHSWNTPHC 171 



> gi il708864|sp|P98157| LRPl CHICK Low-density lipoprotein receptor-related protein 1 
precursor (LRP) 

(Alpha-2-macroglobulin receptor) (A2MR) 
gi I 107 94 16 I pir I IA53102 alpha-2-macroglobulin receptor precursor - chicken 
gi I 438007 | emb | CAA52870 . 1 1 alpha-2-macroglobulin receptor [Callus gallus] 
Length = 4543 

Score - 26.5 bits (55), Expect = 35 

Identities = 8/18 (44%), Positives = 10/18 (55%), Gaps - 4/18 (22%) 

Query: 7 HCWQHLDDC CSHHC 20 

HC + L +C C HHC 
Sbjct: 109 HCREQLANCTALGCQHHC 12 6 



> gi I 13 623303 I qb | AAH0 62 52 . 1 | AAH0 6252 Unknown (protein for MGC: 10619) [Homo sapiens] 
Length - 200 

Score = 26.5 bits (55), Expect = 35 

Identities = 8/11 (72%), Positives - 8/11 (72%), Gaps - 3/11 (27%) 

Query: 19 HCNH-WN--HC 26 

HCNH WN HC 
Sbjct: 161 HCNHSWNTPHC 171 



> gi| 21105785 | gb | AAM3 4 7 90 . 1 jAF512 5 6Q 2 HAC4 [Arabidopsis thaliana] 
Length = 385 

Score = 26.1 bits (54), Expect - 47 

Identities = 11/25 (44%), Positives = 11/25 (44%), Gaps = 10/25 (40%) 

Query: 11 HLDDCCSHHC NHW--NHC 26 

HL CC HC N W NHC 

Sbjct: 90 HLQHCCK-HCTTLMVSGNRWVCNHC 113 



> gi I 6010 654 I gb | AAF012Q7 . 1 1 AF18052 6 1 E-septin short form [Rattus norvegicus] 
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Length = 334 

Score = 26.1 bits (54), Expect = 47 
Identities = 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 118 HINNENCWQ 126 



> gi|18405622|ref |NP 564706.1! CREB-binding protein, putative; protein id: Atlg55970.1, 
supported by 

cDNA: gi_14794965 [Arabidopsis thaliana] 
Length = 1456 

Score = 26.1 bits (54), Expect = 47 

Identities - 11/25 (44%), Positives - 11/25 (44%), Gaps = 10/25 (40%) 

Query: 11 HLDDCCSHHC NHW— NHC 26 

HL CC HC N W NHC 

Sbjct: 1161 HLQHCCK-HCTTLMVSGNRWVCNHC 1184 



> gi I 14 7 94 966 I gb I AAK7 3519 . 1 | HAC4 [Arabidopsis thaliana] 
Length =413 

Score = 26.1 bits (54), Expect - 47 

Identities = 11/25 (44%), Positives = 11/25 (44%), Gaps = 10/25 (40%) 

Query: 11 HLDDCCSHHC NHW--NHC 26 

HL CC HC N W NHC 

Sbjct: 118 HLQHCCK-HCTTLMVSGNRWVCNHC 141 



> gil 13929200 IrefjNP 114025.11 E-septin [Rattus norvegicus] 
gi| 6010 652|qb|AAF012 0 6.11AF180525 1 E-septin long form [Rattus norvegicus] 
Length = 405 

Score = 26.1 bits (54), Expect = 47 
Identities = 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 189 HINNENCWQ 197 



> gi|20093168iref |NP 619243.11 hypothetical protein (multi-domain) [Methanosarcina 
acetivorans 

str. C2A] 

gi|19918512i qb| AAM07 723. 1 j hypothetical protein (multi-domain) [Methanosarcina 
acetivorans 

str. C2A] 
Length = 489 

Score = 26.1 bits (54), Expect = 47 
Identities - 8/14 (57%), Positives = 9/14 (64%) 
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Query: 14 DCCSHHCNHWNHCV 27 

DCC H C+H H V 
Sbjct: 4 63 DCCCHECDHHKHYV 47 6 



> gi I 6090881 I gb I AAF0337 6.il AF170253 1 septin-like protein [Rattus norvegicus] 
Length = 564 

Score - 26.1 bits (54), Expect = 47 
Identities = 6/9 (66%), Positives - 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 34 9 HINNENCWQ 357 



> gi i 668 3817 I gb | AAF2337 4 . 1 { MLL septin-like fusion protein MSF-A [Homo sapiens] 
Length = 586 

Score = 26.1 bits (54), Expect = 47 
Identities - 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 369 HINNENCWQ 377 



> gi I 17560900 I ref I NP 506264.11 Predicted CDS, major sperm protein family member family 
member 

[Caenorhabditis elegans] 
gi I 7503252 Ipir I IT22096 hypothetical protein F42E8.2 - Caenorhabditis elegans 
gi [ 3877 03 9 I emb [ CAB014 29. 1 | Hypothetical protein F42E8.2 [Caenorhabditis elegans] 
Length = 122 

Score - 26.1 bits (54), Expect - 47 

Identities - 8/12 (66%), Positives = 8/12 (66%), Gaps - 2/12 (16%) 

Query: 13 DDCCSHHCNHWN 24 

D CCSH C WN 
Sbjct: 96 DACCSHSC--WN 105 



> gi| 8 778322 |gb[AAF7 9331. 1 IAC0023Q4 24 F14J16.27 [Arabidopsis thaliana] 
Length = 1550 

Score = 26.1 bits (54), Expect - 47 

Identities = 11/25 (44%), Positives = 11/25 (44%), Gaps - 10/25 (40%) 

Query: 11 HLDDCCSHHC NHW--NHC 2 6 

HL CC HC N W NHC 

Sbjct: 122 9 HLQHCCK-HCTTLMVSGNRWVCNHC 1252 



> gi|5729933|ref |NP 006631. 1| MLL septin-like fusion; MLL septin-like fusion (NOTE: n. 
standard 

symbol and name); septin Dl [Homo sapiens] 
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gi| 5106557 |qb|AAD3974 9. 1IAF123Q52 1 MLL septin-like fusion protein [Homo sapiens] 
gi[ 11055011 igb I AAG27 919. 1 [AF14 2408 1 cell division control protein septin Dl [Homo 
sapiens ] 

gi I 14 530109 |emb|CAC4 2223. 1 | OVARIAN/Breast septin alpha [Homo sapiens] 
gi| 18203688 |gb|AAH21192. 1 IAAH21192 MLL septin-like fusion [Homo sapiens] 
Length - 568 

Score - 26.1 bits (54), Expect = 47 
Identities = 6/9 (66%), Positives - 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 351 HINNENCWQ 359 



> gi|24656402|ref |NP 611502.21 CG10438-PA [Drosophila melanogaster ] 
gii24656406[ref |NP 725988.11 CG30144-PA [Drosophila melanogaster] 
gi|73Q2348|gb|AAF57437.1| CG10438-PA [Drosophila melanogaster] 
gi|2162 68 95 |gb|AAM683 96. 1 | CG30144-PA [Drosophila melanogaster] 
Length = 108 

Score - 26.1 bits (54), Expect = 47 

Identities = 7/10 (70%), Positives = 8/10 (80%), Gaps = 1/10 (10%) 

Query: 15 CCSHHCNH-W 23 

CC+H CNH W 
Sbjct: 97 CCNHECNHTW 106 



> gi|22066365|ref [XP 113892.2) similar to MLL septin-like fusion; MLL septin-like fusion 
(NOTE: 

non-standard symbol and name); septin Dl [Homo sapiens] 
Length = 569 

Score = 26.1 bits (54), Expect = 47 
Identities = 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 352 HINNENCWQ 360 



> gii 14 530107 | emb | CAC4 2222 . 1 | OVARIAN/Breast septin delta [Homo sapiens] 
Length = 335 

Score = 26.1 bits (54), Expect = 47 
Identities = 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 118 HINNENCWQ 12 6 

> gi| 6Q5 6385|gbiAAF0284 9.1[AC00 98 94 20 Unknown protein [Arabidopsis thaliana] 
Length - -1209 

Score = 26.1 bits (54), Expect - 47 
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Identities - 11/25 (44%), Positives = 11/25 (44%), Gaps - 10/25 (40%) 

Query: 11 HLDDCCSHHC NHW--NHC 26 

HL CC HC N W NHC 

Sbjct': 1152 HLQHCCK-HCTTLMVSGNRWVCNHC 1175 



> gi I 104 33 911 I dbj I BAB14 057 . 1 j unnamed protein product [Homo sapiens] 
Length = 196 

Score = 26.1 bits (54), Expect = 47 
Identities - 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 118 HINNENCWQ 126 



> gii8393784 Iref |NP 059076.11 septin 9; MLL septin-like fusion; SL3-3 integration site 1 
[Mus 

musculus] 

gi 120347564 Iref |XP 109283.11 MLL septin-like fusion [Mus musculus] 

gi I 6165419 I emb | CAB5 98 33. 1 | septin-like protein Sintl [Mus musculus] 

gi ! 17 98 0 682 [ gb | AAL50685 . 1 | septin 9 [Mus musculus] 
Length - 334 

Score =26.1 bits (54), Expect = 47 
Identities - 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 118 HINNENCWQ 126 



> gi| 14 530105 iemb|CAC42221. II OVARIAN/Breast septin gamma [Homo sapiens] 
Length = 579 

Score - 26.1 bits (54), Expect = 47 
Identities = 6/9 (66%), Positives - 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 362 HINNENCWQ 370 



> gi j 7512587 jpir I IT12519 hypothetical protein DKFZp4 34 F24 3 . 1 - human (fragment) 
gi I 52 62571 i emb j CAB4 572 8 . 1 | hypothetical protein [Homo sapiens] 
Length = 234 

Score = 26.1 bits (54), Expect - 47 
Identities = 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 17 HINNENCWQ 25 



11 



> gi [ 4 589626 I dbj I BAA76835 . 1 | KIAA0991 protein [Homo sapiens] 

gi I 6683815 Igbl AAF2337 3. 1 | MLL septin-like fusion protein MSF-B [Homo sapiens] 

gi I 14 5 30111 | emb | CAC4 222 4 . 1 | OVARIAN/Breast septin beta [Homo sapiens] 
Length - 422 

Score = 26.1 bits (54), Expect = 47 
Identities = 6/9 (66%), Positives - 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 205 HINNENCWQ 213 



> giil8582Q88|ref |XP 090617.11 similar to Adenylate cyclase, type VII (ATP 
pyrophosphate-lyase ) 

(Adenylyl cyclase) ■ [Homo sapiens] 

Length - 180 

Score = 26.1 bits (54), Expect = 47 

Identities = 7/12 (58%), Positives = 7/12 (58%), Gaps - 2/12 (16%) 

Query: 7 HCWQHLDDCCSH 18 

CWQ DCCS 
Sbjct: 25 QCWQR--DCCSQ 34 



Score =18.0 bits (35), Expect = 12551 
Identities = 4/4 (100%), Positives = 4/4 (100%) 

Query: 5 NQHC 8 
NQHC 

Sbjct: 148 NQHC 151 



> gi| 60 90911|gb|AAF03391.1|AF17 38 99 1 septin-like protein SLP-b [Rattus norvegicus] 
Length = 479 

Score =26.1 bits (54), Expect - 47 
Identities = 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 HIXNQHCWQ 10 

HI N++CWQ 
Sbjct: 264 HINNENCWQ 272 



> gi I 20892533 I ref |XP 156144.11 hypothetical protein XP_156144 [Mus musculus] 
Length = 213 

Score = 26.1 bits (54), Expect = 47 
Identities = 6/8 (75%), Positives - 7/8 (87%) 



Query: 5 NQHCWQHL 12 

NQ CW+HL 
Sbjct: 4 0 NQRCWEHL 4 7 
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> gi i 23058070 I qbi ZP 00083175,11 hypothetical protein [Pseudomonas fluorescens] 
Length = 176 

Score - 25.7 bits (53), Expect = 63 

Identities = 8/15 (53%), Positives = 8/15 (53%), Gaps = 5/15 (33%) 

Query: 6 QHCWQHLDDCCSHHC 20 

Q CW CCSH C 

Sbjct: 164 QRCW CCSHCC 173 



> gi I 24 4 5 958 0 I gb | AAN34 381 . 1 | envelope glycoprotein [Human immunodeficiency virus 1] 
Length = 211 

Score = 25.7 bits (53), Expect = 63 

Identities = 7/11 (63%), Positives - 7/11 (63%), Gaps = 4/11 (36%) 

Query: 19 HCN HWNH 25 

HON HWNH 
Sbjct: 63 HCNISKGHWNH 73 



> gi I 20844494 I ref |XP 137952.11 similar to putative retrovirus-related gag protein [Mus 
musculus ] 

Length = 212 
Score = 25.7 bits (53),- Expect = 63 

Identities - 7/11 (63%), Positives = 8/11 (72%), Gaps = 1/11 (9%) 

Query: 2 HIXNQHCWQHL 12 

HI NQ CW+ L 
Sbjct: 31 HI-NQQCWERL 4 0 



> gi I 9629632 I ref jNP 044917.11 tegument protein/FGARAT [murid herpesvirus 4] 
gi i 2318002 | qb| AAB664 60. 1 | tegument protein/FGARAT [murid herpesvirus 4] 
gi I 6625 64 0 j gb| AAF19341 . 1 | AF1Q5037 73 75a [murid herpesvirus 4] 
Length = 1291 

Score = 25.7 bits (53), Expect = 63 
Identities - 6/6 (100%), Positives = 6/6 (100%) 

Query: 9 WQHLDD 14 

WQHLDD 
Sbjct: 1261 WQHLDD 1266 



> gi|23272552|qb j AAH3557 4 . 1 | chromosome 6 open reading frame 8 [Homo sapiens] 
Length = 293 

Score = 25.7 bits (53), Expect - 63 

Identities = 10/22 (45%), Positives = 12/22 (54%), Gaps = 10/22 (45%) 

Query: 7 HCWQHLD--DC CSHHC 20 

HC H+D +C CSHHC 
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Sbjct: 140 HC — HVDVDECRTSITLCSHHC 159 



> gi I 13449287 I ref I NP 085155.11 NG3 protein [Homo sapiens] 
qi I 18 41553 I qb | AAB4 7 4 94 . 1 | AAB4 7 4 94 NG3 [Homo sapiens] 
Length - 293 

Score = 25.7 bits (53), Expect = 63 

Identities - 10/22 (45%), Positives = 12/22 (54%), Gaps = 10/22 (45%) 

Query: 7 HCWQHLD — DC CSHHC 20 

HC H+D +C CSHHC 
Sbjct: 140 HC — HVDVDECRTSITLCSHHC 159 



> qi i 16924 037 I qb j AAL3164 9 . 1 | AGO 7 917 9 4 Putative o-methyltransf erase ZRP4 [Oryza sativa] 

qi|17Q47043| qb | AAL34 94 8 . 1 | AC07 9037 21 Putative to o-methyltransf erase ZRP4 [Oryza 
sativa] 

Length = 366 
Score = 25.2 bits (52), Expect = 85 

Identities = 7/9 (77%), Positives - 7/9 (77%), Gaps - 1/9 (11%) 

Query: 7 HCWQHLDDC 15 

HCWQ DDC 
Sbjct: 271 HCWQD-DDC 278 



> gi I 23472065 i qb I ZP 00127393.11 hypothetical protein [Pseudomonas syringae pv. syringae 
B728a] 

Length - 409 

Score = 25.2 bits (52), Expect - 85 
Identities = 6/7 (85%), Positives = 6/7 (85%) 

Query: 21 NHWNHCV 27 

NHW HCV 
Sbjct: 194 NHWYHCV 200 



> gi I 20908811 1 ref |XP 138742.11 similar to endogenous retroviral family W, env(C7), 
member 1 

(syncytin) ; envelope protein; syncytin [Homo sapiens] 
[Mus musculus] 
Length = 712 

Score = 25.2 bits (52), Expect - 85 

Identities = 12/26 (46%), Positives = 12/26 (46%), Gaps - 8/26 (30%) 

Query: 1 CHIXNQHCWQHLDDCCSHHCNHWNHC 2 6 

CH HC Q L D C HH HC 
Sbjct: 34 CH HHCLQPLLDSCHHH HC 51 



> gi I 15598375 I ref I NP 251869.11 conserved hypothetical protein [Pseudomonas aeruginosa 
PAOl] 
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gi I 11347983 j pir I IE83247 conserved hypothetical protein PA3179 [imported] - Pseudomonas 

aeruginosa (strain PAOl) 
gi I 994 92 97 i gb | AAG0 65 67 . 1 | AE004 7 42 3 conserved hypothetical protein [Pseudomonas 
aeruginosa PAOl] 

Length - 386 

Score = 25.2 bits (52), Expect = 85 
Identities = 6/7 (85%), Positives = 6/7 (85%) 

Query: 21 NHWNHCV 27 

NHW HCV 
Sbjct: 191 NHWYHCV 197 



> gi i 23058698 Igbj ZP 00083733.11 hypothetical protein [Pseudomonas fluorescens] 
Length = 448 

Score = 25.2 bits (52), Expect = 85 
Identities - 6/7 (85%), Positives = 6/7 (85%) 

Query: 21 NHWNHCV 27 

NHW HCV 
Sbjct: 237 NHWYHCV 243 



> gi|20468866|ref |XP 117141.11 hypothetical protein XP_117141 [Homo sapiens] 
Length = 169 

Score = 25.2 bits (52), Expect = 85 

Identities - 8/13 (61%), Positives = 10/13 (76%), Gaps = 1/13 (7%) 

Query: 15 CC-SHHCNHWNHC 26 

CC SHH +H +HC 
Sbjct: 128 CCLSHHRHHRSHC 140 



> gii 15983410|gb{AAL1157 3.1|AF4 2 4 57 9 1 AT4gl 657 0 /dl4 310w [Arabidopsis thaliana] 
gi I 2 3308 4 05 | gb|AAN18172 . 1 | At4gl6570/dl4310w [Arabidopsis thaliana] 
Length = 457 

Score - 24.8 bits (51), Expect = 114 
Identities - 5/8 (62%), Positives - 6/8 (75%) 

Query: 20 CNHWNHCV 27 

C+HW CV 
Sbjct: 82 CDHWKQCV 8 9 
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Number of sequences in database: 1,242,768 

Lambda K H 

0.359 0.268 2.40 

Gapped 

Lambda K H 

0.294 0.110 0.610 



Matrix: PAM30 

Gap Penalties: Existence: 9, Extension: 1 

Number of Hits to DB: 17,703,379 

Number of Sequences: 1242768 

Number of extensions: 331126 

Number of successful extensions: 8258 

Number of sequences better than 20000.0: 8042 

Number of HSP's better than 20000.0 without gapping: 8050 

Number of HSP's successfully gapped in prelim test: 0 

Number of HSP's that attempted gapping in prelim test: 0 

Number of HSP's gapped (non-prelim): 8242 

length of query: 27 

length of database: 395,571,179 

effective HSP length: 18 

effective length of query: 9 

effective length of database: 373,201,355 

effective search space: 3358812195 

effective search space used: 3358812195 



T: 


11 








A: 


40 








XI 


14 


( 7 


3 


bits) 


X2 


35 


(14 


8 


bits) 


X3 


58 


(24 


6 


bits) 


SI 


34 


(19 


5 


bits) 


82 


34 


(17 


6 


bits) 



us 096668370GP1 




Creation date: 01-08-2003 
Indexing Officer: PHOENIX - PHOENIX 
Team: PhxAdmlnistrator 
Dossier: 09666837 



Legal Date: 28-04-2003 



No. 


Doccode 


Number of^aqes 


1 


CAD 





Total number of pages: 1 
Remarks: 



Order of re-scan issued on 



